SEQUENCE LISTING 



<110> Barnett, Susan 
Zur Megede, Jan 



<120> POLYNUCLEOTIDES ENCODING ANTIGENIC HIV TYPE C 
POLYPEPTIDES, POLYPEPTIDES AND USES THEREOF 



<130> PP01631.101 



<140> 09/475,704 
<141> 1999-12-30 



<150> 09/610,313 
<151> 2000-07-05 

<160> 46 

<170> Patentin Ver. 2.0 



<210> 1 
<211> 60 
<212> DNA 

<213> Human immunodeficiency virus 



<400> 1 

gacatcaagc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 60 

<210> 2 
<211> 60 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 2 

gacatccgcc agggccccaa ggagcccttc cgcgactacg tggaccgctt cttcaagacc 60 

<210> 3 
<211> 1479 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic Gag 
of HIV strain AF110965 



<400> 3 

atgggcgccc 

ctgcgccccg 

ctggagaagt 

atccgccagc 

accgtggcca 

ctggacaaga 

gccgacaagg 

gtgcaccagg 

gccttcagcc 

gacctgaaca 

gacaccatca 



gcgccagcat 
gcggcaagaa 
tcgccctgaa 
tgcaccccgc 
ccctgtactg 
tcgaggagga 
gcaaggtgag 
ccatcagccc 
ccgaggtgat 
cgatgttgaa 
acgaggaggc 



cctgcgcggc 
gtgctacatg 
ccccggcctg 
cctgcagacc 
cgtgcacgag 
gcagaacaag 
ccagaactac 
ccgcaccctg 
ccccatgttc 
caccgtgggc 
cgccgagtgg 



ggcaagctgg 
atgaagcacc 
ctggagacca 
ggcagcgagg 
aagatcgagg 
tgccagcaga 
cccatcgtgc 
aacgcctggg 
accgccctga 
ggccaccagg 
gaccgcgtgc 



acgcctggga 
tggtgtgggc 
gcgagggctg 
agctgaagag 
tccgcgacac 
agatccagca 
agaacctgca 
tgaaggtgat 
gcgagggcgc 
ccgccatgca 
accccgtgca 



gcgcatccgc 
cagccgcgag 
caagcagatc 
cctgttcaac 
caaggaggcc 
ggccgaggcc 
gggccagatg 
cgaggagaag 
caccccccag 
gatgctgaag 
cgccggcccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 



1 



atcgcccccg 
ctgcaggagc 
aagcggtgga 
ctggacatca 
accctgcgcg 
gtgcagaacg 
ctggaggaga 
ctggccgagg 
ggcccccggc 
tgccgcgccc 
gactgcaccg 
cccggcaact 
ttcgaggaga 
ctgaagagcc 



gccagatgcg 
agatcgcctg 
tcatcctggg 
agcagggccc 
ccgagcagag 
ccaaccccga 
tgatgaccgc 
cgatgagcca 
gcatcgtcaa 
cccgcaagaa 
agcgccaggc 
tcctgcagag 
ccacccccgg 
tgttcggcaa 



cgagccccgc 
gatgaccagc 
cctgaacaag 
caaggagccc 
cacccaggag 
ctgcaagacc 
ctgccagggc 
ggccaacacc 
gtgcttcaac 
gggctgctgg 
caacttcctg 
ccgccccgag 
ccagaagcag 
cgaccccctg 



ggcagcgaca 
aaccccccca 
atcgtgcgga 
ttccgcgact 
gtgaagaact 
atcctgcgcg 
gtgggcggcc 
agcgtgatga 
tgcggcaagg 
aagtgcggca 
ggcaagatct 
cccaccgccc 
gagagcaagg 
agccagtaa 



tcgccggcac 
tccccgtggg 
tgtacagccc 
acgtggaccg 
ggatgaccga 
ctctcggccc 
ccagccacaa 
tgcagaagag 
agggccacat 
aggagggcca 
ggcccagcca 
cccccgccga 
accgcgagac 



caccagcacc 
cgacatctac 
cgtgagcatc 
cttcttcaag 
caccctgctg 
cggcgccagc 
ggcccgcgtg 
caacttcaag 
cgcccgcaac 
ccagatgaag 
caagggccgc 
gagcttccgc 
cctgaccagc 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1479 



<210> 4 
<211> 1509 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : synthetic Gag 
of HIV strain AF110967 



<400> 4 

atgggcgccc 

ctgcgccccg 

ctggagggct 

atgaagcagc 

accgtggcca 

ctggacaaga 

gccgacggca 

caccaggcca 

ttcagccccg 

ctgaacacga 

accatcaacg 

gcccccggcc 

caggagcaga 

cggtggatca 

gacatccgcc 

ctgcgcgccg 

cagaacgcca 

gaggagatga 

gccgaggcga 

ggcccccggc 

tgccgcgccc 

gactgcaccg 

cccggcaact 

cccgccgaga 

cgcgagccct 

agccagtaa 



gcgccagcat 
gcggcaagaa 
tcgccctgaa 
tgcagcccgc 
ccctgtactg 
tcgaggagga 
aggtgagcca 
tcagcccccg 
aggtgatccc 
tgttgaacac 
aggaggccgc 
agatgcgcga 
tcgcctggat 
tcctgggcct 
agggccccaa 
agcaggccac 
accccgactg 
tgaccgcctg 
tgagccaggc 
gcaacgtcaa 
cccgcaagaa 
agcgccaggc 
tcctgcagaa 
gcttccgctt 
accgcgagcc 



cctgcgcggc 
gcactacatg 
ccccggcctg 
cctgcagacc 
cgtgcacgcc 
gcagaacaag 
gaactacccc 
caccctgaac 
catgttcacc 
cgtgggcggc 
cgagtgggac 
cccccgcggc 
gaccagcaac 
gaacaagatc 
ggagcccttc 
ccaggacgtg 
caagaccatc 
ccagggcgtg 
caacagcgtg 
gtgcttcaac 
gggctgctgg 
caacttcctg 
ccgcagcgag 
cgaggagacc 
cctgaccgcc 



gagaagctgg 
ctgaagcacc 
ctggagaccg 
ggcaccgagg 
ggcatcgagg 
tcccagcaga 
atcgtgcaga 
gcctgggtga 
gccctgagcg 
caccaggccg 
cgcctgcacc 
agcgacatcg 
ccccccgtgc 
gtgcggatgt 
cgcgactacg 
aagaactgga 
ctgcgcgctc 
ggcggccccg 
aacatcatga 
tgcggcaagg 
aagtgcggca 
ggcaagatct 
cccgccgccc 
acccccgccc 
ctgcgcagcc 



acaagtggga 
tggtgtgggc 
ccgagggctg 
agctgcgcag 
tccgcgacac 
agacccagca 
acctgcaggg 
aggtgatcga 
agggcgccac 
ccatgcagat 
ccgtgcaggc 
ccggcgccac 
ccgtgggcga 
acagccccgt 
tggaccgctt 
tgaccgagac 
tcggccccgg 
gccacaaggc 
tgcagaagag 
agggccacat 
aggagggcca 
ggcccagcca 
ccaccgtgcc 
ccaagcagga 
tgttcggcag 



gaagatccgc 
cagccgcgag 
caagcagatc 
cctgtacaac 
caaggaggcc 
ggccaaggag 
ccagatggtg 
ggagaaggcc 
cccccaggac 
gctgaaggac 
cggccccgtg 
cagcaccctg 
catctacaag 
gagcatcctg 
cttcaagacc 
cctgctggtg 
cgccaccctg 
ccgcgtgctg 
caacttcaag 
cgccaagaac 
ccagatgaag 
caagggccgc 
caccgccccc 
gcccaaggac 
cggccccctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1509 



<210> 5 
<211> 141 
<212> DNA 

<213> Artificial Sequence 



2 



<220> 

<223> Description of Artificial Sequence: Env common 
region of HIV strain AF110968 

<400> 5 

accatcacca tcacctgccg catcaagcag atcatcaaca tgtggcagaa ggtgggccgc 60 
gccatgtacg ccccccccat cgccggcaac ctgacctgcg agagcaacat caccggcctg 120 
ctgctgaccc gcgacggcgg c 141 



<210> 6 
<211> 1431 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic 
gpl20 coding region of HIV strain AF110968 



<400> 6 

agcgtggtgg 

aagaccaccc 

tgggccaccc 

gtgaccgaga 

atcagcctgt 

ctgaagtgcc 

aagggcgaga 

gaggtgcacg 

gagtaccgcc 

ttcgacccca 

aaccagacct 

ggcatcaagc 

atcatcatcc 

aagcccgtga 

ggccccggcc 

tgcatcatca 

gagcacttca 

accacccaca 

aacagcacct 

atcacctgcc 

gcccccccca 

cgcgacggcg 

atgcgcgaca 

ggcgtggccc 



gcaacctgtg 
tgttctgcac 
acgcctgcgt 
acttcaacat 
gggaccagag 
gcaacgtgaa 
tgaagaactg 
ccctgttcta 
tgatcaactg 
tccccatcca 
tcaacggcac 
ccgtggtgag 
gcagcgagaa 
agatcgtgtg 
agaccttcta 
acaagaccga 
gcaagaaggc 
gcttcaactg 
acagccccag 
gcatcaagca 
tcgccggcaa 
gcaagaccgg 
actggcgcaa 
ccaccgaggc 



ggtgaccgtg 
cagcgacgcc 
gcccaccgac 
gtggaagaac 
cctgaagccc 
cgccaccaac 
cagcttcaac 
ccgcctggac 
caacaccagc 
ctactgcacc 
cggcccctgc 
cacccagctg 
cctggccaac 
cgtgcgcccc 
cgccaccggc 
gtggaacagc 
catcaagttc 
ccgcggcgag 
cttcaacggc 
gatcatcaac 
cctgacctgc 
ccccaacgac 
cgagctgtac 
caagcgccgc 



tactacggcg 
aaggcctacg 
cccaaccccc 
gacatggtgg 
tgcgtgaagc 
aacatcaaca 
gtgaccaccg 
gtggtgcccc 
gccatcaccc 
cccgccggct 
aacaacgtga 
ctgctgaacg 
aacgccaaga 
aacaacaaca 
gagatcatcg 
accctgcagg 
gagcccagca 
ttcttctact 
accgagaaca 
atgtggcaga 
gagagcaaca 
accgagatct 
aagtacaagg 
gtggtggagc 



tgcccgtgtg 
agaccgaggt 
aggagatcgt 
accagatgca 
tgacccccct 
gcatgatcga 
agctgcgcga 
tgcagggcaa 
aggcctgccc 
acgccatcct 
gcagcgtgca 
gcagcctggc 
tcatcatcgt 
cccgcaagag 
gcgacatccg 
gcgtgagcaa 
gcggcggcga 
gcgacaccag 
agctgaacgg 
aggtgggccg 
tcaccggcct 
tccgccccgg 
tggtggagat 
gcgagaagcg 



gaaggaggcc 
gcacaacgtg 
gctggagaac 
cgaggacatc 
gtgcgtgacc 
caacagcaac 
ccgcaagcag 
caacagcaac 
caaggtgagc 
gaagtgcaac 
gtgcgcccac 
caagggcgag 
gcagctgaac 
cgtgcgcatc 
ccaggcctac 
gaagctggag 
cctggagatc 
ccagctgttc 
caccatcacc 
cgccatgtac 
gctgctgacc 
cggcggcgac 
caagcccctg 
c 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1431 



<210> 7 
<211> 1944 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic 
gpl40 coding region of HIV strain AF110968 

<400> 7 

agcgtggtgg gcaacctgtg ggtgaccgtg tactacggcg tgcccgtgtg gaaggaggcc 60 
aagaccaccc tgttctgcac cagcgacgcc aaggcctacg agaccgaggt gcacaacgtg 120 



3 



tgggccaccc 
gtgaccgaga 
atcagcctgt 
ctgaagtgcc 
aagggcgaga 
gaggtgcacg 
gagtaccgcc 
ttcgacccca 
aaccagacct 
ggcatcaagc 
atcatcatcc 
aagcccgtga 
ggccccggcc 
tgcatcatca 
gagcacttca 
accacccaca 
aacagcacct 
atcacctgcc 
gcccccccca 
cgcgacggcg 
atgcgcgaca 
ggcgtggccc 
atcggcgccg 
atcaccctga 
ctgctgcgcg 
cagctgcaga 
atctggggct 
agcaaccgca 
atcaacaact 
aagaacgaga 
atcaccaact 



acgcctgcgt 
acttcaacat 
gggaccagag 
gcaacgtgaa 
tgaagaactg 
ccctgttcta 
tgatcaactg 
tccccatcca 
tcaacggcac 
ccgtggtgag 
gcagcgagaa 
agatcgtgtg 
agaccttcta 
acaagaccga 
gcaagaaggc 
gcttcaactg 
acagccccag 
gcatcaagca 
tcgccggcaa 
gcaagaccgg 
actggcgcaa 
ccaccgaggc 
tgttcctggg 
ccgtgcaggc 
ccatcgaggc 
cccgcatcct 
gcagcggcaa 
gccacgacga 
acaccgacac 
aggacctgct 
ggctgtggta 



gcccaccgac 
gtggaagaac 
cctgaagccc 
cgccaccaac 
cagcttcaac 
ccgcctggac 
caacaccagc 
ctactgcacc 
cggcccctgc 
cacccagctg 
cctggccaac 
cgtgcgcccc 
cgccaccggc 
gtggaacagc 
catcaagttc 
ccgcggcgag 
cttcaacggc 
gatcatcaac 
cctgacctgc 
ccccaacgac 
cgagctgtac 
caagcgccgc 
cttcctgggc 
ccgcctgctg 
ccagcagcac 
ggccgtggag 
gctgatctgc 
gatctgggac 
catctaccgc 
ggccctggac 
catc 



cccaaccccc 
gacatggtgg 
tgcgtgaagc 
aacatcaaca 
gtgaccaccg 
gtggtgcccc 
gccatcaccc 
cccgccggct 
aacaacgtga 
ctgctgaacg 
aacgccaaga 
aacaacaaca 
gagatcatcg 
accctgcagg 
gagcccagca 
ttcttctact 
accgagaaca 
atgtggcaga 
gagagcaaca 
accgagatct 
aagtacaagg 
gtggtggagc 
gccgccggca 
ctgagcggca 
ctgctgcagc 
cgctacctga 
accaccgccg 
aacatgacct 
ctgctggagg 
agctggcaga 



aggagatcgt 
accagatgca 
tgacccccct 
gcatgatcga 
agctgcgcga 
tgcagggcaa 
aggcctgccc 
acgccatcct 
gcagcgtgca 
gcagcctggc 
tcatcatcgt 
cccgcaagag 
gcgacatccg 
gcgtgagcaa 
gcggcggcga 
gcgacaccag 
agctgaacgg 
aggtgggccg 
tcaccggcct 
tccgccccgg 
tggtggagat 
gcgagaagcg 
gcaccatggg 
tcgtgcagca 
tgaccgtgtg 
aggaccagca 
tgccctggaa 
ggatgcagtg 
agagccagaa 
acctgtggaa 



gctggagaac 
cgaggacatc 
gtgcgtgacc 
caacagcaac 
ccgcaagcag 
caacagcaac 
caaggtgagc 
gaagtgcaac 
gtgcgcccac 
caagggcgag 
gcagctgaac 
cgtgcgcatc 
ccaggcctac 
gaagctggag 
cctggagatc 
ccagctgttc 
caccatcacc 
cgccatgtac 
gctgctgacc 
cggcggcgac 
caagcccctg 
cgccgtgggc 
cgccgccagc 
gcagaacaac 
gggcatcaag 
gctgctgggc 
cagcagctgg 
ggaccgcgag 
ccagcaggag 
ctggttcagc 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1944 



<210> 8 
<211> 2466 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
gpl60 coding region of HIV strain AF110968 



<400> 8 

agcgtggtgg 

aagaccaccc 

tgggccaccc 

gtgaccgaga 

atcagcctgt 

ctgaagtgcc 

aagggcgaga 

gaggtgcacg 

gagtaccgcc 

ttcgacccca 

aaccagacct 

ggcatcaagc 

atcatcatcc 

aagcccgtga 

ggccccggcc 



gcaacctgtg 
tgttctgcac 
acgcctgcgt 
acttcaacat 
gggaccagag 
gcaacgtgaa 
tgaagaactg 
ccctgttcta 
tgatcaactg 
tccccatcca 
tcaacggcac 
ccgtggtgag 
gcagcgagaa 
agatcgtgtg 
agaccttcta 



ggtgaccgtg 
cagcgacgcc 
gcccaccgac 
gtggaagaac 
cctgaagccc 
cgccaccaac 
cagcttcaac 
ccgcctggac 
caacaccagc 
ctactgcacc 
cggcccctgc 
cacccagctg 
cctggccaac 
cgtgcgcccc 
cgccaccggc 



tactacggcg 
aaggcctacg 
cccaaccccc 
gacatggtgg 
tgcgtgaagc 
aacatcaaca 
gtgaccaccg 
gtggtgcccc 
gccatcaccc 
cccgccggct 
aacaacgtga 
ctgctgaacg 
aacgccaaga 
aacaacaaca 
gagatcatcg 



tgcccgtgtg 
agaccgaggt 
aggagatcgt 
accagatgca 
tgacccccct 
gcatgatcga 
agctgcgcga 
tgcagggcaa 
aggcctgccc 
acgccatcct 
gcagcgtgca 
gcagcctggc 
tcatcatcgt 
cccgcaagag 
gcgacatccg 



gaaggaggcc 
gcacaacgtg 
gctggagaac 
cgaggacatc 
gtgcgtgacc 
caacagcaac 
ccgcaagcag 
caacagcaac 
caaggtgagc 
gaagtgcaac 
gtgcgcccac 
caagggcgag 
gcagctgaac 
cgtgcgcatc 
ccaggcctac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 



4 



tgcatcatca 
gagcacttca 
accacccaca 
aacagcacct 
atcacctgcc 
gcccccccca 
cgcgacggcg 
atgcgcgaca 
ggcgtggccc 
atcggcgccg 
atcaccctga 
ctgctgcgcg 
cagctgcaga 
atctggggct 
agcaaccgca 
atcaacaact 
aagaacgaga 
atcaccaact 
ctgcgcatca 
ctgcccttcc 
gaggagggcg 
ctggcctggg 
atcctgatcg 
tacctgggca 
ctggacacca 
cgcatctgcc 
ctgcag 



acaagaccga 
gcaagaaggc 
gcttcaactg 
acagccccag 
gcatcaagca 
tcgccggcaa 
gcaagaccgg 
actggcgcaa 
ccaccgaggc 
tgttcctggg 
ccgtgcaggc 
ccatcgaggc 
cccgcatcct 
gcagcggcaa 
gccacgacga 
acaccgacac 
aggacctgct 
ggctgtggta 
tcttcgccgt 
agaccctgac 
gcgagcagga 
acgacctgcg 
ccgcccgcgt 
gcctggtgca 
tcgccatcgc 
gcgccatccg 



gtggaacagc 
catcaagttc 
ccgcggcgag 
cttcaacggc 
gatcatcaac 
cctgacctgc 
ccccaacgac 
cgagctgtac 
caagcgccgc 
cttcctgggc 
ccgcctgctg 
ccagcagcac 
ggccgtggag 
gctgatctgc 
gatctgggac 
catctaccgc 
ggccctggac 
catcaagatc 
gctgagcatc 
ccccaacccc 
ccgcggccgc 
cagcctgtgc 
gctggagctg 
gtactggggc 
cgtggccgag 
caacatcccc 



accctgcagg 
gagcccagca 
ttcttctact 
accgagaaca 
atgtggcaga 
gagagcaaca 
accgagatct 
aagtacaagg 
gtggtggagc 
gccgccggca 
ctgagcggca 
ctgctgcagc 
cgctacctga 
accaccgccg 
aacatgacct 
ctgctggagg 
agctggcaga 
ttcatcatga 
gtgaaccgcg 
cgcgagcccg 
agcatccgcc 
ctgttcagct 
ctgggccagc 
ctggagctga 
ggcaccgacc 
cgccgcatcc 



gcgtgagcaa 
gcggcggcga 
gcgacaccag 
agctgaacgg 
aggtgggccg 
tcaccggcct 
tccgccccgg 
tggtggagat 
gcgagaagcg 
gcaccatggg 
tcgtgcagca 
tgaccgtgtg 
aggaccagca 
tgccctggaa 
ggatgcagtg 
agagccagaa 
acctgtggaa 
tcgtgggcgg 
tgcgccaggg 
accgcctggg 
tggtgagcgg 
accaccgcct 
gcggctggga 
agaagagcgc 
gcatcatcga 
gccagggctt 



gaagctggag 
cctggagatc 
ccagctgttc 
caccatcacc 
cgccatgtac 
gctgctgacc 
cggcggcgac 
caagcccctg 
cgccgtgggc 
cgccgccagc 
gcagaacaac 
gggcatcaag 
gctgctgggc 
cagcagctgg 
ggaccgcgag 
ccagcaggag 
ctggttcagc 
cctgatcggc 
ctacagcccc 
ccgcatcgag 
cttcctggcc 
gcgcgacttc 
ggccctgaag 
catcagcctg 
gttcatccag 
cgaggccgcc 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2466 



<210> 9 
<211> 2547 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
signal sequence and gpl60 coding region of HIV 
strain AF110968 



<400> 9 

atgcgcgtga 

ttctggatgc 

gtgcccgtgt 

gagaccgagg 

caggagatcg 

gaccagatgc 

ctgacccccc 

agcatgatcg 

gagctgcgcg 

ctgcagggca 

caggcctgcc 

tacgccatcc 

agcagcgtgc 

ggcagcctgg 

atcatcatcg 

acccgcaaga 

ggcgacatcc 

ggcgtgagca 



tgggcatcct 
tgatcatcag 
ggaaggaggc 
tgcacaacgt 
tgctggagaa 
acgaggacat 
tgtgcgtgac 
acaacagcaa 
accgcaagca 
acaacagcaa 
ccaaggtgag 
tgaagtgcaa 
agtgcgccca 
ccaagggcga 
tgcagctgaa 
gcgtgcgcat 
gccaggccta 
agaagctgga 



gaagaactac 
cagcgtggtg 
caagaccacc 
gtgggccacc 
cgtgaccgag 
catcagcctg 
cctgaagtgc 
caagggcgag 
ggaggtgcac 
cgagtaccgc 
cttcgacccc 
caaccagacc 
cggcatcaag 
gatcatcatc 
caagcccgtg 
cggccccggc 
ctgcatcatc 
ggagcacttc 



cagcagtggt 
ggcaacctgt 
ctgttctgca 
cacgcctgcg 
aacttcaaca 
tgggaccaga 
cgcaacgtga 
atgaagaact 
gccctgttct 
ctgatcaact 
atccccatcc 
ttcaacggca 
cccgtggtga 
cgcagcgaga 
aagatcgtgt 
cagaccttct 
aacaagaccg 
agcaagaagg 



ggatgtgggg 
gggtgaccgt 
ccagcgacgc 
tgcccaccga 
tgtggaagaa 
gcctgaagcc 
acgccaccaa 
gcagcttcaa 
accgcctgga 
gcaacaccag 
actactgcac 
ccggcccctg 
gcacccagct 
acctggccaa 
gcgtgcgccc 
acgccaccgg 
agtggaacag 
ccatcaagtt 



catcctgggc 
gtactacggc 
caaggcctac 
ccccaacccc 
cgacatggtg 
ctgcgtgaag 
caacatcaac 
cgtgaccacc 
cgtggtgccc 
cgccatcacc 
ccccgccggc 
caacaacgtg 
gctgctgaac 
caacgccaag 
caacaacaac 
cgagatcatc 
caccctgcag 
cgagcccagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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agcggcggcg 
tgcgacacca 
aagctgaacg 
aaggtgggcc 
atcaccggcc 
ttccgccccg 
gtggtggaga 
cgcgagaagc 
agcaccatgg 
atcgtgcagc 
ctgaccgtgt 
aaggaccagc 
gtgccctgga 
tggatgcagt 
gagagccaga 
aacctgtgga 
atcgtgggcg 
gtgcgccagg 
gaccgcctgg 
ctggtgagcg 
taccaccgcc 
cgcggctggg 
aagaagagcg 
cgcatcatcg 
cgccagggct 



acctggagat 
gccagctgtt 
gcaccatcac 
gcgccatgta 
tgctgctgac 
gcggcggcga 
tcaagcccct 
gcgccgtggg 
gcgccgccag 
agcagaacaa 
ggggcatcaa 
agctgctggg 
acagcagctg 
gggaccgcga 
accagcagga 
actggttcag 
gcctgatcgg 
gctacagccc 
gccgcatcga 
gcttcctggc 
tgcgcgactt 
aggccctgaa 
ccatcagcct 
agttcatcca 
tcgaggccgc 



caccacccac 
caacagcacc 
catcacctgc 
cgcccccccc 
ccgcgacggc 
catgcgcgac 
gggcgtggcc 
catcggcgcc 
catcaccctg 
cctgctgcgc 
gcagctgcag 
catctggggc 
gagcaaccgc 
gatcaacaac 
gaagaacgag 
catcaccaac 
cctgcgcatc 
cctgcccttc 
ggaggagggc 
cctggcctgg 
catcctgatc 
gtacctgggc 
gctggacacc 
gcgcatctgc 
cctgcag 



agcttcaact 
tacagcccca 
cgcatcaagc 
atcgccggca 
ggcaagaccg 
aactggcgca 
cccaccg'agg 
gtgttcctgg 
accgtgcagg 
gccatcgagg 
acccgcatcc 
tgcagcggca 
agccacgacg 
tacaccgaca 
aaggacctgc 
tggctgtggt 
atcttcgccg 
cagaccctga 
ggcgagcagg 
gacgacctgc 
gccgcccgcg 
agcctggtgc 
atcgccatcg 
cgcgccatcc 



gccgcggcga 
gcttcaacgg 
agatcatcaa 
acctgacctg 
gccccaacga 
acgagctgta 
ccaagcgccg 
gcttcctggg 
cccgcctgct 
cccagcagca 
tggccgtgga 
agctgatctg 
agatctggga 
ccatctaccg 
tggccctgga 
acatcaagat 
tgctgagcat 
cccccaaccc 
accgcggccg 
gcagcctgtg 
tgctggagct 
agtactgggg 
ccgtggccga 
gcaacatccc 



gttcttctac 
caccgagaac 
catgtggcag 
cgagagcaac 
caccgagatc 
caagtacaag 
cgtggtggag 
cgccgccggc 
gctgagcggc 
cctgctgcag 
gcgctacctg 
caccaccgcc 
caacatgacc 
cctgctggag 
cagctggcag 
cttcatcatg 
cgtgaaccgc 
ccgcgagccc 
cagcatccgc 
cctgttcagc 
gctgggccag 
cctggagctg 
gggcaccgac 
ccgccgcatc 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2547 



<210> 10 
<211> 1035 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic a 
gp41 coding region of HIV strain AF110968 



<400> 10 

gccgtgggca 

gccgccagca 

cagaacaacc 

ggcatcaagc 

ctgctgggca 

agcagctgga 

gaccgcgaga 

cagcaggaga 

tggttcagca 

ctgatcggcc 

tacagccccc 

cgcatcgagg 

ttcctggccc 

cgcgacttca 

gccctgaagt 

atcagcctgc 

ttcatccagc 

gaggccgccc 

<210> 11 
<211> 144 



tcggcgccgt 
tcaccctgac 
tgctgcgcgc 
agctgcagac 
tctggggctg 
gcaaccgcag 
tcaacaacta 
agaacgagaa 
tcaccaactg 
tgcgcatcat 
tgcccttcca 
aggagggcgg 
tggcctggga 
tcctgatcgc 
acctgggcag 
tggacaccat 
gcatctgccg 
tgcag 



gttcctgggc 
cgtgcaggcc 
catcgaggcc 
ccgcatcctg 
cagcggcaag 
ccacgacgag 
caccgacacc 
ggacctgctg 
gctgtggtac 
cttcgccgtg 
gaccctgacc 
cgagcaggac 
cgacctgcgc 
cgcccgcgtg 
cctggtgcag 
cgccatcgcc 
cgccatccgc 



ttcctgggcg 
cgcctgctgc 
cagcagcacc 
gccgtggagc 
ctgatctgca 
atctgggaca 
atctaccgcc 
gccctggaca 
atcaagatct 
ctgagcatcg 
cccaaccccc 
cgcggccgca 
agcctgtgcc 
ctggagctgc 
tactggggcc 
gtggccgagg 
aacatccccc 



ccgccggcag 
tgagcggcat 
tgctgcagct 
gctacctgaa 
ccaccgccgt 
acatgacctg 
tgctggagga 
gctggcagaa 
tcatcatgat 
tgaaccgcgt 
gcgagcccga 
gcatccgcct 
tgttcagcta 
tgggccagcg 
tggagctgaa 
gcaccgaccg 
gccgcatccg 



caccatgggc 
cgtgcagcag 
gaccgtgtgg 
ggaccagcag 
gccctggaac 
gatgcagtgg 
gagccagaac 
cctgtggaac 
cgtgggcggc 
gcgccagggc 
ccgcctgggc 
ggtgagcggc 
ccaccgcctg 
cggctgggag 
gaagagcgcc 
catcatcgag 
ccagggcttc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1035 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic Env 
common region of HIV strain AF110975 



<400> 11 

agcatcatca ccctgccctg ccgcatcaag 
cgcgccatct acgccccccc catcgagggc 
ctgctgctgg cccgcgacgg cggc 



cagatcatcg acatgtggca gaaggtgggc 60 
aacatcacct gcagcagcag catcaccggc 120 

144 



<210> 12 
<211> 1437 
<212> DMA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic 
gpl20 coding region of HIV strain AF110975 



<400> 12 

agcggcctgg 

agcaccaccc 

tgggccaccc 

gtgaccgaga 

atcagcctgt 

ctgaagtgca 

aacaacacca 

aagaagcagc 

agcagcgagt 

gtgagcttcg 

tgcaagaaca 

acccacggca 

ggcggcgaga 

cacctgaacg 

atccgcatcg 

caggcccact 

aagctgcgcg 

ctggagatca 

aagctgttca 

atcatcaccc 

gccatctacg 

ctgctggccc 

ggcgacatga 

cccctgggcg 



gcaacctgtg 
tgttctgcgc 
acgcctgcgt 
acttcaacat 
gggaccagag 
ccaactacag 
ccgaggagat 
aggtgtacgc 
accgcctgat 
accccatccc 
acaccagcaa 
tcaagcccgt 
tcatcatccg 
acagcgtgga 
gccccggcca 
gcaacatcag 
agcacttccc 
ccacccacag 
acagcagcta 
tgccctgccg 
ccccccccat 
gcgacggcgg 
aggacaactg 
tggcccccac 



ggtgaccgtg 
cagcgacgcc 
gcccaccgac 
gtggaagaac 
cctgaagccc 
caccaactac 
caagaactgc 
cctgttctac 
caactgcaac 
catccactac 
cggcaccggc 
ggtgagcacc 
cagcaagaac 
gatcgtgtgc 
gaccttctac 
cgccggcgag 
caacaagacc 
cttcaactgc 
caacggcacc 
catcaagcag 
cgagggcaac 
cctggacaac 
gcgcaacgag 
cgaggccaag 



tacgacggcg 
aaggcctacg 
cccaaccccc 
gacatggtgg 
cgcgtgaagc 
agcaacacca 
accttcaaca 
aagctggaca 
accagcgcca 
tgcgcccccg 
ccctgccaga 
cccctgctgc 
ctgagcaaca 
acccgcccca 
gccaccgaga 
tggaacaagg 
atcgagttcc 
cgcggcgagt 
agctaccgcg 
atcatcgaca 
atcacctgca 
atcaccaccg 
ctgtacaagt 
cgccgcgtgg 



tgcccgtgtg 
agaaggaggt 
aggagatcga 
accagatgca 
tgacccccct 
tgaacgccac 
tgaccaccga 
tcgtgcccct 
tcacccaggc 
ccggctacgc 
acgtgagcac 
tgaacggcag 
acgcctacac 
acaacaacac 
acatcatcgg 
ccgtgcagcg 
agcccagcag 
tcttctactg 
gcaccgagag 
tgtggcagaa 
gcagcagcat 
agatcttccg 
acaaggtggt 
tggagcgcga 



gcgcgaggcc 
gcacaacgtg 
gctggacaac 
cgaggacatc 
gtgcgtgacc 
cagctacaac 
gctgcgcgac 
gaacagcaac 
ctgccccaag 
catcctgaag 
cgtgcagtgc 
cctggccgag 
catcatcgtg 
ccgcaagggc 
cgacatccgc 
cgtgagcgcc 
cggcggcgac 
caacaccagc 
caacagcagc 
ggtgggccgc 
caccggcctg 
cccccagggc 
ggagatcaag 
gaagcgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1437 



<210> 13 
<211> 1950 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: synthetic 
gpl40 coding region of HIV strain AF110975 



<400> 13 
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agcggcctgg 
agcaccaccc 
tgggccaccc 
gtgaccgaga 
atcagcctgt 
ctgaagtgca 
aacaacacca 
aagaagcagc 
agcagcgagt 
gtgagcttcg 
tgcaagaaca 
acccacggca 
ggcggcgaga 
cacctgaacg 
atccgcatcg 
caggcccact 
aagctgcgcg 
ctggagatca 
aagctgttca 
atcatcaccc 
gccatctacg 
ctgctggccc 
ggcgacatga 
cccctgggcg 
gtgggcatcg 
gccagcatca 
agcaacctgc 
atcaagcagc 
ctgggcatct 
agctggagca 
aaggagatca 
caggagcaga 
ttcaacatca 



gcaacctgtg 
tgttctgcgc 
acgcctgcgt 
acttcaacat 
gggaccagag 
ccaactacag 
ccgaggagat 
aggtgtacgc 
accgcctgat 
accccatccc 
acaccagcaa 
tcaagcccgt 
tcatcatccg 
acagcgtgga 
gccccggcca 
gcaacatcag 
agcacttccc 
ccacccacag 
acagcagcta 
tgccctgccg 
ccccccccat 
gcgacggcgg 
aggacaactg 
tggcccccac 
gcgccgtgat 
ccctgaccgc 
tgcgcgccat 
tgcaggcccg 
ggggctgcag 
acaagaccca 
gcaactacac 
acgagaagga 
gcaactggct 



ggtgaccgtg 
cagcgacgcc 
gcccaccgac 
gtggaagaac 
cctgaagccc 
caccaactac 
caagaactgc 
cctgttctac 
caactgcaac 
catccactac 
cggcaccggc 
ggtgagcacc 
cagcaagaac 
gatcgtgtgc 
gaccttctac 
cgccggcgag 
caacaagacc 
cttcaactgc 
caacggcacc 
catcaagcag 
cgagggcaac 
cctggacaac 
gcgcaacgag 
cgaggccaag 
cttcggcttc 
ccaggcccgc 
cgaggcccag 
cgtgctggcc 
cggcaagctg 
gggcgagatc 
cggcatcatc 
cctgctggcc 
gtggtacatc 



tacgacggcg 
aaggcctacg 
cccaaccccc 
gacatggtgg 
cgcgtgaagc 
agcaacacca 
accttcaaca 
aagctggaca 
accagcgcca 
tgcgcccccg 
ccctgccaga 
cccctgctgc 
ctgagcaaca 
acccgcccca 
gccaccgaga 
tggaacaagg 
atcgagttcc 
cgcggcgagt 
agctaccgcg 
atcatcgaca 
atcacctgca 
atcaccaccg 
ctgtacaagt 
cgccgcgtgg 
ctgggcgccg 
cagctgctga 
cagcacatgc 
atcgagcgct 
atctgcacca 
tgggagaaca 
taccgcctgc 
ctggacagcc 



tgcccgtgtg 
agaaggaggt 
aggagatcga 
accagatgca 
tgacccccct 
tgaacgccac 
tgaccaccga 
tcgtgcccct 
tcacccaggc 
ccggctacgc 
acgtgagcac 
tgaacggcag 
acgcctacac 
acaacaacac 
acatcatcgg 
ccgtgcagcg 
agcccagcag 
tcttctactg 
gcaccgagag 
tgtggcagaa 
gcagcagcat 
agatcttccg 
acaaggtggt 
tggagcgcga 
ccggcagcaa 
gcggcatcgt 
tgcagctgac 
acctgaagga 
ccaccgtgcc 
tgacctggat 
tggaggagag 
gcaacaacct 



gcgcgaggcc 
gcacaacgtg 
gctggacaac 
cgaggacatc 
gtgcgtgacc 
cagctacaac 
gctgcgcgac 
gaacagcaac 
ctgccccaag 
catcctgaag 
cgtgcagtgc 
cctggccgag 
catcatcgtg 
ccgcaagggc 
cgacatccgc 
cgtgagcgcc 
cggcggcgac 
caacaccagc 
caacagcagc 
ggtgggccgc 
caccggcctg 
cccccagggc 
ggagatcaag 
gaagcgcgcc 
catgggcgcc 
gcagcagcag 
cgtgtggggc 
ccagcagctg 
ctggaacagc 
gcagtgggac 
ccagaaccag 
gtggagctgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1950 



<210> 14 

<211> 2493 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
gpl60 coding region of HIV strain AF110975 



<400> 14 

agcggcctgg 

agcaccaccc 

tgggccaccc 

gtgaccgaga 

atcagcctgt 

ctgaagtgca 

aacaacacca 

aagaagcagc 

agcagcgagt 

gtgagcttcg 

tgcaagaaca 

acccacggca 



gcaacctgtg 
tgttctgcgc 
acgcctgcgt 
acttcaacat 
gggaccagag 
ccaactacag 
ccgaggagat 
aggtgtacgc 
accgcctgat 
accccatccc 
acaccagcaa 
tcaagcccgt 



ggtgaccgtg 
cagcgacgcc 
gcccaccgac 
gtggaagaac 
cctgaagccc 
caccaactac 
caagaactgc 
cctgttctac 
caactgcaac 
catccactac 
cggcaccggc 
ggtgagcacc 



tacgacggcg 
aaggcctacg 
cccaaccccc 
gacatggtgg 
cgcgtgaagc 
agcaacacca 
accttcaaca 
aagctggaca 
accagcgcca 
tgcgcccccg 
ccctgccaga 
cccctgctgc 



tgcccgtgtg 
agaaggaggt 
aggagatcga 
accagatgca 
tgacccccct 
tgaacgccac 
tgaccaccga 
tcgtgcccct 
tcacccaggc 
ccggctacgc 
acgtgagcac 
tgaacggcag 



gcgcgaggcc 
gcacaacgtg 
gctggacaac 
cgaggacatc 
gtgcgtgacc 
cagctacaac 
gctgcgcgac 
gaacagcaac 
ctgccccaag 
catcctgaag 
cgtgcagtgc 
cctggccgag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 
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ggcggcgaga 
cacctgaacg 
atccgcatcg 
caggcccact 
aagctgcgcg 
ctggagatca 
aagctgttca 
atcatcaccc 
gccatctacg 
ctgctggccc 
ggcgacatga 
cccctgggcg 
gtgggcatcg 
gccagcatca 
agcaacctgc 
atcaagcagc 
ctgggcatct 
agctggagca 
aaggagatca 
caggagcaga 
ttcaacatca 
atcggcctgc 
agccccctga 
atcgaggagg 
ctggccctgg 
gacctgatcc 
ctgcagcgcg 
gagctgaaga 
accgaccgca 
cgcgtgcgcc 



tcatcatccg 
acagcgtgga 
gccccggcca 
gcaacatcag 
agcacttccc 
ccacccacag 
acagcagcta 
tgccctgccg 
ccccccccat 
gcgacggcgg 
aggacaactg 
tggcccccac 
gcgccgtgat 
ccctgaccgc 
tgcgcgccat 
tgcaggcccg 
ggggctgcag 
acaagaccca 
gcaactacac 
acgagaagga 
gcaactggct 
gcatcatctt 
gcttccagac 
agggcggcga 
cctgggacga 
tggtgaccgc 
gctgggaggc 
agagcgccac 
tcatcgaggt 
agggcttcga 



cagcaagaac 
gatcgtgtgc 
gaccttctac 
cgccggcgag 
caacaagacc 
cttcaactgc 
caacggcacc 
catcaagcag 
cgagggcaac 
cctggacaac 
gcgcaacgag 
cgaggccaag 
cttcggcttc 
ccaggcccgc 
cgaggcccag 
cgtgctggcc 
cggcaagctg 
gggcgagatc 
cggcatcatc 
cctgctggcc 
gtggtacatc 
cgccgtgctg 
cctgaccccc 
gcaggaccgc 
cctgcgcagc 
ccgcgtggtg 
cctgaagtac 
cagcctgctg 
gatccagcgc 
ggccgccctg 



ctgagcaaca 
acccgcccca 
gccaccgaga 
tggaacaagg 
atcgagttcc 
cgcggcgagt 
agctaccgcg 
atcatcgaca 
atcacctgca 
atcaccaccg 
ctgtacaagt 
cgccgcgtgg 
ctgggcgccg 
cagctgctga 
cagcacatgc 
atcgagcgct 
atctgcacca 
tgggagaaca 
taccgcctgc 
ctggacagcc 
aagatcttca 
agcatcgtga 
aacccccgcg 
gaccgcagca 
ctgtgcctgt 
gagctgctgg 
ctgggcagcc 
gacagcatcg 
atctaccgcg 
cag 



acgcctacac 
acaacaacac 
acatcatcgg 
ccgtgcagcg 
agcccagcag 
tcttctactg 
gcaccgagag 
tgtggcagaa 
gcagcagcat 
agatcttccg 
acaaggtggt 
tggagcgcga 
ccggcagcaa 
gcggcatcgt 
tgcagctgac 
acctgaagga 
ccaccgtgcc 
tgacctggat 
tggaggagag 
gcaacaacct 
tcatgatcgt 
accgcgtgcg 
gcctggaccg 
tccgcctggt 
tcagctacca 
gccgcagcag 
tggtgcagta 
ccatcgccgt 
ccttctgcaa 



catcatcgtg 
ccgcaagggc 
cgacatccgc 
cgtgagcgcc 
cggcggcgac 
caacaccagc 
caacagcagc 
ggtgggccgc 
caccggcctg 
cccccagggc 
ggagatcaag 
gaagcgcgcc 
catgggcgcc 
gcagcagcag 
cgtgtggggc 
ccagcagctg 
ctggaacagc 
gcagtgggac 
ccagaaccag 
gtggagctgg 
gggcggcctg 
ccagggctac 
cctgggccgc 
gcagggcttc 
ccgcctgcgc 
cccccgcggc 
ctggggcctg 
ggccgagggc 
catcccccgc 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2493 



<210> 15 
<211> 2565 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
signal sequence and gpl60 coding region of HIV 
strain AF110975 



<400> 15 

atgcgcgtgc 

ttctggatct 

tggcgcgagg 

gtgcacaacg 

gagctggaca 

cacgaggaca 

ctgtgcgtga 

accagctaca 

gagctgcgcg 

ctgaacagca 

gcctgcccca 

gccatcctga 

accgtgcagt 

agcctggccg 



gcggcatcct 
gcagcggcct 
ccagcaccac 
tgtgggccac 
acgtgaccga 
tcatcagcct 
ccctgaagtg 
acaacaacac 
acaagaagca 
acagcagcga 
aggtgagctt 
agtgcaagaa 
gcacccacgg 
agggcggcga 



gcgcagctgg 
gggcaacctg 
cctgttctgc 
ccacgcctgc 
gaacttcaac 
gtgggaccag 
caccaactac 
caccgaggag 
gcaggtgtac 
gtaccgcctg 
cgaccccatc 
caacaccagc 
catcaagccc 
gatcatcatc 



cagcagtggt 
tgggtgaccg 
gccagcgacg 
gtgcccaccg 
atgtggaaga 
agcctgaagc 
agcaccaact 
atcaagaact 
gccctgttct 
atcaactgca 
cccatccact 
aacggcaccg 
gtggtgagca 
cgcagcaaga 



ggatctgggg 
tgtacgacgg 
ccaaggccta 
accccaaccc 
acgacatggt 
cccgcgtgaa 
acagcaacac 
gcaccttcaa 
acaagctgga 
acaccagcgc 
actgcgcccc 
gcccctgcca 
cccccctgct 
acctgagcaa 



catcctgggc 60 
cgtgcccgtg 120 
cgagaaggag 180 
ccaggagatc 240 
ggaccagatg 300 
gctgaccccc 360 
catgaacgcc 420 
catgaccacc 480 
catcgtgccc 540 
catcacccag 600 
cgccggctac 660 
gaacgtgagc 720 
gctgaacggc 780 
caacgcctac 840 



9 



accatcatcg 
acccgcaagg 
ggcgacatcc 
cgcgtgagcg 
agcggcggcg 
tgcaacacca 
agcaacagca 
aaggtgggcc 
atcaccggcc 
cgcccccagg 
gtggagatca 
gagaagcgcg 
aacatgggcg 
gtgcagcagc 
accgtgtggg 
gaccagcagc 
ccctggaaca 
atgcagtggg 
agccagaacc 
ctgtggagct 
gtgggcggcc 
cgccagggct 
cgcctgggcc 
gtgcagggct 
caccgcctgc 
agcccccgcg 
tactggggcc 
gtggccgagg 
aacatccccc 



tgcacctgaa 
gcatccgcat 
gccaggccca 
ccaagctgcg 
acctggagat 
gcaagctgtt 
gcatcatcac 
gcgccatcta 
tgctgctggc 
gcggcgacat 
agcccctggg 
ccgtgggcat 
ccgccagcat 
agagcaacct 
gcatcaagca 
tgctgggcat 
gcagctggag 
acaaggagat 
agcaggagca 
ggttcaacat 
tgatcggcct 
acagccccct 
gcatcgagga 
tcctggccct 
gcgacctgat 
gcctgcagcg 
tggagctgaa 
gcaccgaccg 
gccgcgtgcg 



cgacagcgtg 
cggccccggc 
ctgcaacatc 
cgagcacttc 
caccacccac 
caacagcagc 
cctgccctgc 
cgcccccccc 
ccgcgacggc 
gaaggacaac 
cgtggccccc 
cggcgccgtg 
caccctgacc 
gctgcgcgcc 
gctgcaggcc 
ctggggctgc 
caacaagacc 
cagcaactac 
gaacgagaag 
cagcaactgg 
gcgcatcatc 
gagcttccag 
ggagggcggc 
ggcctgggac 
cctggtgacc 
cggctgggag 
gaagagcgcc 
catcatcgag 
ccagggcttc 



gagatcgtgt 
cagaccttct 
agcgccggcg 
cccaacaaga 
agcttcaact 
tacaacggca 
cgcatcaagc 
atcgagggca 
ggcctggaca 
tggcgcaacg 
accgaggcca 
atcttcggct 
gcccaggccc 
atcgaggccc 
cgcgtgctgg 
agcggcaagc 
cagggcgaga 
accggcatca 
gacctgctgg 
ctgtggtaca 
ttcgccgtgc 
accctgaccc 
gagcaggacc 
gacctgcgca 
gcccgcgtgg 
gccctgaagt 
accagcctgc 
gtgatccagc 
gaggccgccc 



gcacccgccc 
acgccaccga 
agtggaacaa 
ccatcgagtt 
gccgcggcga 
ccagctaccg 
agatcatcga 
acatcacctg 
acatcaccac 
agctgtacaa 
agcgccgcgt 
tcctgggcgc 
gccagctgct 
agcagcacat 
ccatcgagcg 
tgatctgcac 
tctgggagaa 
tctaccgcct 
ccctggacag 
tcaagatctt 
tgagcatcgt 
ccaacccccg 
gcgaccgcag 
gcctgtgcct 
tggagctgct 
acctgggcag 
tggacagcat 
gcatctaccg 
tgcag 



caacaacaac 
gaacatcatc 
ggccgtgcag 
ccagcccagc 
gttcttctac 
cggcaccgag 
catgtggcag 
cagcagcagc 
cgagatcttc 
gtacaaggtg 
ggtggagcgc 
cgccggcagc 
gagcggcatc 
gctgcagctg 
ctacctgaag 
caccaccgtg 
catgacctgg 
gctggaggag 
ccgcaacaac 
catcatgatc 
gaaccgcgtg 
cggcctggac 
catccgcctg 
gttcagctac 
gggccgcagc 
cctggtgcag 
cgccatcgcc 
cgccttctgc 



900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2565 



<210> 16 

<211> 1056 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic a 
gp41 coding region of HIV strain AF110975 



<400> 16 

gccgtgggca 

gccgccagca 

cagagcaacc 

ggcatcaagc 

ctgctgggca 

agcagctgga 

gacaaggaga 

cagcaggagc 

tggttcaaca 

ctgatcggcc 

tacagccccc 

cgcatcgagg 

ttcctggccc 

cgcgacctga 

ggcctgcagc 

ctggagctga 



tcggcgccgt 
tcaccctgac 
tgctgcgcgc 
agctgcaggc 
tctggggctg 
gcaacaagac 
tcagcaacta 
agaacgagaa 
tcagcaactg 
tgcgcatcat 
tgagcttcca 
aggagggcgg 
tggcctggga 
tcctggtgac 
gcggctggga 
agaagagcgc 



gatcttcggc 
cgcccaggcc 
catcgaggcc 
ccgcgtgctg 
cagcggcaag 
ccagggcgag 
caccggcatc 
ggacctgctg 
gctgtggtac 
cttcgccgtg 
gaccctgacc 
cgagcaggac 
cgacctgcgc 
cgcccgcgtg 
ggccctgaag 
caccagcctg 



ttcctgggcg 
cgccagctgc 
cagcagcaca 
gccatcgagc 
ctgatctgca 
atctgggaga 
atctaccgcc 
gccctggaca 
atcaagatct 
ctgagcatcg 
cccaaccccc 
cgcgaccgca 
agcctgtgcc 
gtggagctgc 
tacctgggca 
ctggacagca 



ccgccggcag 
tgagcggcat 
tgctgcagct 
gctacctgaa 
ccaccaccgt 
acatgacctg 
tgctggagga 
gccgcaacaa 
tcatcatgat 
tgaaccgcgt 
gcggcctgga 
gcatccgcct 
tgttcagcta 
tgggccgcag 
gcctggtgca 
tcgccatcgc 



caacatgggc 60 
cgtgcagcag 120 
gaccgtgtgg 180 
ggaccagcag 240 
gccctggaac 300 
gatgcagtgg 360 
gagccagaac 420 
cctgtggagc 480 
cgtgggcggc 540 
gcgccagggc 600 
ccgcctgggc 660 
ggtgcagggc 720 
ccaccgcctg 780 
cagcccccgc 840 
gtactggggc 900 
cgtggccgag 960 
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ggcaccgacc gcatcatcga ggtgatccag cgcatctacc gcgccttctg caacatcccc 1020 
cgccgcgtgc gccagggctt cgaggccgcc ctgcag 1056 



<210> 17 
<211> 492 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 17 

Met Gly Ala Arg Ala Ser lie Leu Arg Gly Gly Lys Leu Asp Ala Trp 
15 10 15 

Glu Arg lie Arg Leu Arg Pro Gly Gly Lys Lys Cys Tyr Met Met Lys 
20 25 30 

His Leu Val Trp Ala Ser Arg Glu Leu Glu Lys Phe Ala Leu Asn Pro 
35 40 45 

Gly Leu Leu Glu Thr Ser Glu Gly Cys Lys Gin lie He Arg Gin Leu 
50 55 60 

His Pro Ala Leu Gin Thr Gly Ser Glu Glu Leu Lys Ser Leu Phe Asn 
65 70 75 80 

Thr Val Ala Thr Leu Tyr Cys Val His Glu Lys He Glu Val Arg Asp 
85 90 95 

Thr Lys Glu Ala Leu Asp Lys He Glu Glu Glu Gin Asn Lys Cys Gin 
100 105 110 

Gin Lys He Gin Gin Ala Glu Ala Ala Asp Lys Gly Lys Val Ser Gin 
115 120 125 

Asn Tyr Pro He Val Gin Asn Leu Gin Gly Gin Met Val His Gin Ala 
130 135 140 

He Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val He Glu Glu Lys 
145 150 155 160 

Ala Phe Ser Pro Glu Val He Pro Met Phe Thr Ala Leu Ser Glu Gly 
165 170 175 

Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His 
180 185 190 

Gin Ala Ala Met Gin Met Leu Lys Asp Thr He Asn Glu Glu Ala Ala 
195 200 205 

Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro He Ala Pro Gly 
210 215 220 

Gin Met Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr 
225 230 235 240 

Leu Gin Glu Gin He Ala Trp Met Thr Ser Asn Pro Pro He Pro Val 
245 250 255 
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Gly Asp 



lie Tyr Lys Arg Trp lie lie Leu Gly Leu Asn Lys 
260 265 270 



He Val 



Arg Met Tyr Ser 
275 

Glu Pro Phe Arg 
290 

Glu Gin Ser Thr 
305 

Val Gin Asn Ala 



Pro Gly Ala Ser 
340 

Gly Pro Ser His 
355 

Asn Thr Ser Val 
370 

He Val Lys Cys 
385 



Pro Val Ser He 
280 

Asp Tyr Val Asp 
295 

Gin Glu Val Lys 
310 

Asn Pro Asp Cys 
325 

Leu Glu Glu Met 



Lys Ala Arg Val 
360 

Met Met Gin Lys 
375 

Phe Asn Cys Gly 
390 



Leu Asp He Lys 



Arg Phe Phe Lys 
300 

Asn Trp Met Thr 
315 

Lys Thr He Leu 
330 

Met Thr Ala Cys 
345 

Leu Ala Glu Ala 



Ser Asn Phe Lys 
380 

Lys Glu Gly His 
395 



Gin Gly Pro Lys 
285 

Thr Leu Arg Ala 



Asp Thr Leu Leu 
320 

Arg Ala Leu Gly 
335 

Gin Gly Val Gly 
350 

Met Ser Gin Ala 
365 

Gly Pro Arg Arg 



He Ala Arg Asn 
400 



Cys Arg Ala Pro 



His Gin Met Lys 
420 

He Trp Pro Ser 
435 

Pro Glu Pro Thr 
450 

Thr Pro Gly Gin 
465 

Leu Lys Ser Leu 



Arg Lys Lys Gly 
405 

Asp Cys Thr Glu 



His Lys Gly Arg 
440 

Ala Pro Pro Ala 
455 

Lys Gin Glu Ser 
470 

Phe Gly Asn Asp 
485 



Cys Trp Lys Cys 
410 

Arg Gin Ala Asn 
425 

Pro Gly Asn Phe 



Glu Ser Phe Arg 
460 

Lys Asp Arg Glu 
475 

Pro Leu Ser Gin 
490 



Gly Lys Glu Gly 
415 

Phe Leu Gly Lys 
430 

Leu Gin Ser Arg 
445 

Phe Glu Glu Thr 



Thr Leu Thr Ser 
480 



<210> 18 
<211> 81 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
signal sequence of HIV strain AF110968 

<400> 18 
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atgcgcgtga tgggcatcct gaagaactac cagcagtggt ggatgtgggg catcctgggc 
ttctggatgc tgatcatcag c 



60 
81 



<210> 19 
<211> 72 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
signal sequence of HIV strain AF110975 

<400> 19 

atgcgcgtgc gcggcatcct gcgcagctgg cagcagtggt ggatctgggg catcctgggc 60 
ttctggatct gc 72 

<210> 20 
<211> 1479 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic Gag 
coding sequence of HIV strain AF110965 



<400> 20 

atgggcgccc 

ctgcgccccg 

ctggagaagt 

atccgccagc 

accgtggcca 

ctggacaaga 

gccgacaagg 

gtgcaccagg 

gccttcagcc 

gacctgaaca 

gacaccatca 

atcgcccccg 

ctgcaggagc 

aagcgctgga 

ctggacatca 

accctgcgcg 

gtgcagaacg 

ctggaggaga 

ctggccgagg 

ggcccccgcc 

tgccgcgccc 

gactgcaccg 

cccggcaact 

ttcgaggaga 

ctgaagagcc 



gcgccagcat 
gcggcaagaa 
tcgccctgaa 
tgcaccccgc 
ccctgtactg 
tcgaggagga 
gcaaggtgag 
ccatcagccc 
ccgaggtgat 
ccatgctgaa 
acgaggaggc 
gccagatgcg 
agatcgcctg 
tcatcctggg 
agcagggccc 
ccgagcagag 
ccaaccccga 
tgatgaccgc 
ccatgagcca 
gcatcgtgaa 
cccgcaagaa 
agcgccaggc 
tcctgcagag 
ccacccccgg 
tgttcggcaa 



cctgcgcggc 
gtgctacatg 
ccccggcctg 
cctgcagacc 
cgtgcacgag 
gcagaacaag 
ccagaactac 
ccgcaccctg 
ccccatgttc 
caccgtgggc 
cgccgagtgg 
cgagccccgc 
gatgaccagc 
cctgaacaag 
caaggagccc 
cacccaggag 
ctgcaagacc 
ctgccagggc 
ggccaacacc 
gtgcttcaac 
gggctgctgg 
caacttcctg 
ccgccccgag 
ccagaagcag 
cgaccccctg 



ggcaagctgg 
atgaagcacc 
ctggagacca 
ggcagcgagg 
aagatcgagg 
tgccagcaga 
cccatcgtgc 
aacgcctggg 
accgccctga 
ggccaccagg 
gaccgcgtgc 
ggcagcgaca 
aaccccccca 
atcgtgcgca 
ttccgcgact 
gtgaagaact 
atcctgcgcg 
gtgggcggcc 
agcgtgatga 
tgcggcaagg 
aagtgcggca 
ggcaagatct 
cccaccgccc 
gagagcaagg 
agccagtaa 



acgcctggga 
tggtgtgggc 
gcgagggctg 
agctgaagag 
tgcgcgacac 
agatccagca 
agaacctgca 
tgaaggtgat 
gcgagggcgc 
ccgccatgca 
accccgtgca 
tcgccggcac 
tccccgtggg 
tgtacagccc 
acgtggaccg 
ggatgaccga 
ccctgggccc 
ccagccacaa 
tgcagaagag 
agggccacat 
aggagggcca 
ggcccagcca 
cccccgccga 
accgcgagac 



gcgcatccgc 
cagccgcgag 
caagcagatc 
cctgttcaac 
caaggaggcc 
ggccgaggcc 
gggccagatg 
cgaggagaag 
caccccccag 
gatgctgaag 
cgccggcccc 
caccagcacc 
cgacatctac 
cgtgagcatc 
cttcttcaag 
caccctgctg 
cggcgccagc 
ggcccgcgtg 
caacttcaag 
cgcccgcaac 
ccagatgaag 
caagggccgc 
gagcttccgc 
cctgaccagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1479 



<210> 21 
<211> 1509 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: synthetic Gag 
coding sequence of HIV strain AF110967 



<400> 21 

atgggcgccc 

ctgcgccccg 

ctggagggct 

atgaagcagc 

accgtggcca 

ctggacaaga 

gccgacggca 

caccaggcca 

ttcagccccg 

ctgaacacca 

accatcaacg 

gcccccggcc 

caggagcaga 

cgctggatca 

gacatccgcc 

ctgcgcgccg 

cagaacgcca 

gaggagatga 

gccgaggcca 

ggcccccgcc 

tgccgcgccc 

gactgcaccg 

cccggcaact 

cccgccgaga 

cgcgagccct 

agccagtaa 



gcgccagcat 
gcggcaagaa 
tcgccctgaa 
tgcagcccgc 
ccctgtactg 
tcgaggagga 
aggtgagcca 
tcagcccccg 
aggtgatccc 
tgctgaacac 
aggaggccgc 
agatgcgcga 
tcgcctggat 
tcctgggcct 
agggccccaa 
agcaggccac 
accccgactg 
tgaccgcctg 
tgagccaggc 
gcaacgtgaa 
cccgcaagaa 
agcgccaggc 
tcctgcagaa 
gcttccgctt 
accgcgagcc 



cctgcgcggc 
gcactacatg 
ccccggcctg 
cctgcagacc 
cgtgcacgcc 
gcagaacaag 
gaactacccc 
caccctgaac 
catgttcacc 
cgtgggcggc 
cgagtgggac 
cccccgcggc 
gaccagcaac 
gaacaagatc 
ggagcccttc 
ccaggacgtg 
caagaccatc 
ccagggcgtg 
caacagcgtg 
gtgcttcaac 
gggctgctgg 
caacttcctg 
ccgcagcgag 
cgaggagacc 
cctgaccgcc 



gagaagctgg 
ctgaagcacc 
ctggagaccg 
ggcaccgagg 
ggcatcgagg 
agccagcaga 
atcgtgcaga 
gcctgggtga 
gccctgagcg 
caccaggccg 
cgcctgcacc 
agcgacatcg 
ccccccgtgc 
gtgcgcatgt 
cgcgactacg 
aagaactgga 
ctgcgcgccc 
ggcggccccg 
aacatcatga 
tgcggcaagg 
aagtgcggca 
ggcaagatct 
cccgccgccc 
acccccgccc 
ctgcgcagcc 



acaagtggga 
tggtgtgggc 
ccgagggctg 
agctgcgcag 
tgcgcgacac 
agacccagca 
acctgcaggg 
aggtgatcga 
agggcgccac 
ccatgcagat 
ccgtgcaggc 
ccggcgccac 
ccgtgggcga 
acagccccgt 
tggaccgctt 
tgaccgagac 
tgggccccgg 
gccacaaggc 
tgcagaagag 
agggccacat 
aggagggcca 
ggcccagcca 
ccaccgtgcc 
ccaagcagga 
tgttcggcag 



gaagatccgc 
cagccgcgag 
caagcagatc 
cctgtacaac 
caaggaggcc 
ggccaaggag 
ccagatggtg 
ggagaaggcc 
cccccaggac 
gctgaaggac 
cggccccgtg 
cagcaccctg 
catctacaag 
gagcatcctg 
cttcaagacc 
cctgctggtg 
cgccaccctg 
ccgcgtgctg 
caacttcaag 
cgccaagaac 
ccagatgaag 
caagggccgc 
caccgccccc 
gcccaaggac 
cggccccctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960. 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1509 



<210> 22 
<211> 502 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 22 

Met Gly Ala Arg Ala Ser lie Leu Arg Gly Glu Lys Leu Asp Lys Trp 
1 5 10 ■ 15 

Glu Lys lie Arg Leu Arg Pro Gly Gly Lys Lys His Tyr Met Leu Lys 
20 25 30 

His Leu Val Trp Ala Ser Arg Glu Leu Glu Gly Phe Ala Leu Asn Pro 
35 40 45 

Gly Leu Leu Glu Thr Ala Glu Gly Cys Lys Gin He Met Lys Gin Leu 
50 55 60 

Gin Pro Ala Leu Gin Thr Gly Thr Glu Glu Leu Arg Ser Leu Tyr Asn 
65 70 75 80 



Thr Val Ala Thr Leu Tyr Cys Val His Ala Gly He Glu Val Arg Asp 
85 90 95 
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Thr Lys Glu Ala Leu Asp Lys lie Glu Glu Glu Gin Asn Lys Ser Gin 
100 105 110 



Gin Lys Thr Gin Gin Ala Lys Glu Ala Asp Gly Lys Val Ser Gin Asn 
115 120 125 

Tyr Pro He Val Gin Asn Leu Gin Gly Gin Met Val His Gin Ala He 
130 135 140 

Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val He Glu Glu Lys Ala 
145 150 155 160 

Phe Ser Pro Glu Val He Pro Met Phe Thr Ala Leu Ser Glu Gly Ala 
165 170 175 

Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin 
180 185 190 

Ala Ala Met Gin Met Leu Lys Asp Thr He Asn Glu Glu Ala Ala Glu 
195 200 205 

Trp Asp Arg Leu His Pro Val Gin Ala Gly Pro Val Ala Pro Gly Gin 
210 215 220 

Met Arg Asp Pro Arg Gly Ser Asp He Ala Gly Ala Thr Ser Thr Leu 
225 230 235 240 

Gin Glu Gin He Ala Trp Met Thr Ser Asn Pro Pro Val Pro Val Gly 
245 250 255 

Asp He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys He Val Arg 
260 265 270 

Met Tyr Ser Pro Val Ser He Leu Asp He Arg Gin Gly Pro Lys Glu 
275 280 285 

Pro Phe Arg Asp Tyr Val Asp Arg Phe Phe Lys Thr Leu Arg Ala Glu 
290 295 300 

Gin Ala Thr Gin Asp Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val 
305 310 315 320 

Gin Asn Ala Asn Pro Asp Cys Lys Thr He Leu Arg Ala Leu Gly Pro 
325 330 335 

Gly Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly 
340 345 350 

Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser Gin Ala Asn 
355 360 365 

Ser Val Asn He Met Met Gin Lys Ser Asn Phe Lys Gly Pro Arg Arg 
370 375 380 



Asn Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His He Ala Lys Asn 
385 390 395 400 
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Cys Arg Ala Pro 



His Gin Met Lys 
420 

lie Trp Pro Ser 
435 

Ser Glu Pro Ala 
450 

Phe Arg Phe Glu 
465 

Arg Glu Pro Tyr 



Arg Lys Lys Gly 
405 

Asp Cys Thr Glu 



His Lys Gly Arg 
440 

Ala Pro Thr Val 
455 

Glu Thr Thr Pro 
470 

Arg Glu Pro Leu 
485 



Cys Trp Lys Cys 
410 

Arg Gin Ala Asn 
425 

Pro Gly Asn Phe 



Pro Thr Ala Pro 
460 

Ala Pro Lys Gin 
475 

Thr Ala Leu Arg 
490 



Gly Lys Glu Gly 
415 

Phe Leu Gly Lys 
430 

Leu Gin Asn Arg 
445 

Pro Ala Glu Ser 



Glu Pro Lys Asp 
480 

Ser Leu Phe Gly 
495 



Ser Gly Pro Leu Ser Gin 
500 



<210> 23 
<211> 849 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 23 

Met Arg Val Met Gly lie Leu Lys Asn Tyr Gin Gin Trp Trp Met Trp 
15 10 15 

Gly lie Leu Gly Phe Trp Met Leu lie lie Ser Ser Val Val Gly Asn 
20 25 30 

Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys 
35 40 45 

Thr Thr Leu Phe Cys Thr Ser Asp Ala Lys Ala Tyr Glu Thr Glu Val 
50 55 60 

His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65 70 75 80 

Gin Glu lie Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys 
85 90 95 

Asn Asp Met Val Asp Gin Met His Glu Asp lie lie Ser Leu Trp Asp 
100 105 110 

Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 

Lys Cys Arg Asn Val Asn Ala Thr Asn Asn lie Asn Ser Met lie Asp 
130 135 140 

Asn Ser Asn Lys Gly Glu Met Lys Asn Cys Ser Phe Asn Val Thr Thr 
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145 



150 



155 



160 



Glu Leu Arg Asp Arg Lys Gin Glu Val His Ala Leu Phe Tyr Arg Leu 
165 170 175 

Asp Val Val Pro Leu Gin Gly Asn Asn Ser Asn Glu Tyr Arg Leu lie 
180 185 190 

Asn Cys Asn Thr Ser Ala lie Thr Gin Ala Cys Pro Lys Val Ser Phe 
195 200 205 

Asp Pro lie Pro lie His Tyr Cys Thr Pro Ala Gly Tyr Ala lie Leu 
210 215 220 

Lys Cys Asn Asn Gin Thr Phe Asn Gly Thr Gly Pro Cys Asn Asn Val 
225 230 235 240 

Ser Ser Val Gin Cys Ala His Gly lie Lys Pro Val Val Ser Thr Gin 
245 250 255 

Leu Leu Leu Asn Gly Ser Leu Ala Lys Gly Glu lie lie lie Arg Ser 
260 265 270 

Glu Asn Leu Ala Asn Asn Ala Lys lie lie lie Val Gin Leu Asn Lys 
275 280 285 

Pro Val Lys lie Val Cys Val Arg Pro Asn Asn Asn Thr Arg Lys Ser 
290 295 300 

Val Arg He Gly Pro Gly Gin Thr Phe Tyr Ala Thr Gly Glu He He 
305 310 315 320 

Gly Asp He Arg Gin Ala Tyr Cys He He Asn Lys Thr Glu Trp Asn 
325 330 335 

Ser Thr Leu Gin Gly Val Ser Lys Lys Leu Glu Glu His Phe Ser Lys 
340 345 350 

Lys Ala He Lys Phe Glu Pro Ser Ser Gly Gly Asp Leu Glu He Thr 
355 360 365 

Thr His Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asp Thr Ser 
370 375 380 

Gin Leu Phe Asn Ser Thr Tyr Ser Pro Ser Phe Asn Gly Thr Glu Asn 
385 390 395 400 

Lys Leu Asn Gly Thr He Thr He Thr Cys Arg He Lys Gin He He 
405 410 415 

Asn Met Trp Gin Lys Val Gly Arg Ala Met Tyr Ala Pro Pro He Ala 
420 425 430 

Gly Asn Leu Thr Cys Glu Ser Asn He Thr Gly Leu Leu Leu Thr Arg 
435 440 445 

Asp Gly Gly Lys Thr Gly Pro Asn Asp Thr Glu He Phe Arg Pro Gly 
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450 



455 



460 



Gly Gly Asp Met Arg Asp Asn Trp 
465 470 

Val Val Glu lie Lys Pro Leu Gly 
485 

Arg Val Val Glu Arg Glu Lys Arg 
500 

Leu Gly Phe Leu Gly Ala Ala Gly 
515 520 



Arg Asn Glu Leu Tyr Lys Tyr Lys 
475 480 

Val Ala Pro Thr Glu Ala Lys Arg 
490 495 

Ala Val Gly He Gly Ala Val Phe 
505 510 

Ser Thr Met Gly Ala Ala Ser He 
525 



Thr Leu Thr Val Gin Ala Arg 
530 535 



Leu Leu Leu Ser Gly He Val Gin Gin 
540 



Gin Asn Asn Leu Leu Arg Ala 
545 550 



He Glu Ala Gin Gin His Leu Leu Gin 
555 560 



Leu Thr Val Trp Gly He Lys 
565 



Gin Leu Gin Thr Arg He Leu Ala Val 
570 575 



Glu Arg Tyr Leu Lys Asp Gin 
580 

Gly Lys Leu He Cys Thr Thr 
595 



Gin Leu Leu Gly He Trp Gly Cys Ser 
585 590 

Ala Val Pro Trp Asn Ser Ser Trp Ser 
600 605 



Asn Arg Ser His Asp Glu He 
610 615 



Trp Asp Asn Met Thr Trp Met Gin Trp 
620 



Asp Arg Glu He Asn Asn Tyr 
625 630 



Thr Asp Thr He Tyr Arg Leu Leu Glu 
635 640 



Glu Ser Gin Asn Gin Gin Glu 
645 



Lys Asn Glu Lys Asp Leu Leu Ala Leu 
650 655 



Asp Ser Trp Gin Asn Leu Trp 
660 

Trp Tyr He Lys He Phe He 
675 



Asn Trp Phe Ser He Thr Asn Trp Leu 
665 670 

Met He Val Gly Gly Leu He Gly Leu 
680 685 



Arg He He Phe Ala Val Leu 
690 695 



Ser He Val Asn Arg Val Arg Gin Gly 
700 



Tyr Ser Pro Leu Pro Phe Gin 
705 710 



Thr Leu Thr Pro Asn Pro Arg Glu Pro 
715 720 



Asp Arg Leu Gly Arg He Glu 
725 

Arg Ser He Arg Leu Val Ser 
740 



Glu Glu Gly Gly Glu Gin Asp Arg Gly 
730 735 

Gly Phe Leu Ala Leu Ala Trp Asp Asp 
745 750 
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Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Phe lie 
755 760 765 



Leu lie Ala Ala Arg Val Leu Glu Leu Leu Gly Gin Arg Gly Trp Glu 
770 775 780 

Ala Leu Lys Tyr Leu Gly Ser Leu Val Gin Tyr Trp Gly Leu Glu Leu 
785 790 795 800 

Lys Lys Ser Ala lie Ser Leu Leu Asp Thr lie Ala lie Ala Val Ala 
805 810 815 



Glu Gly Thr Asp Arg lie lie Glu Phe lie Gin Arg lie Cys Arg Ala 
820 825 830 

lie Arg Asn lie Pro Arg Arg lie Arg Gin Gly Phe Glu Ala Ala Leu 
835 840 845 

Gin 



<210> 24 
<211> 855 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 24 

Met Arg Val Arg Gly lie Leu Arg Ser Trp Gin Gin Trp Trp lie Trp 
15 10 15 

Gly lie Leu Gly Phe Trp lie Cys Ser Gly Leu Gly Asn Leu Trp Val 
20 25 30 

Thr Val Tyr Asp Gly Val Pro Val Trp Arg Glu Ala Ser Thr Thr Leu 
35 40 45 

Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val His Asn Val 
50 55 60 

Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gin Glu He 
65 70 75 80 

Glu Leu Asp Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met 
85 90 95 

Val Asp Gin Met His Glu Asp He He Ser Leu Trp Asp Gin Ser Leu 
100 105 110 

Lys Pro Arg Val Lys Leu Thr Pro Leu Cys Val Thr Leu Lys Cys Thr 
115 120 125 

Asn Tyr Ser Thr Asn Tyr Ser Asn Thr Met Asn Ala Thr Ser Tyr Asn 
130 135 140 

Asn Asn Thr Thr Glu Glu He Lys Asn Cys Thr Phe Asn Met Thr Thr 
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145 



150 



155 



160 



Glu Leu Arg Asp Lys Lys Gin Gin Val Tyr Ala Leu Phe Tyr Lys Leu 
165 170 175 

Asp lie Val Pro Leu Asn Ser Asn Ser Ser Glu Tyr Arg Leu lie Asn 
180 185 190 

Cys Asn Thr Ser Ala lie Thr Gin Ala Cys Pro Lys Val Ser Phe Asp 
195 200 205 

Pro lie Pro lie His Tyr Cys Ala Pro Ala Gly Tyr Ala lie Leu Lys 
210 215 220 

Cys Lys Asn Asn Thr Ser Asn Gly Thr Gly Pro Cys Gin Asn Val Ser 
225 230 235 240 

Thr Val Gin Cys Thr His Gly lie Lys Pro Val Val Ser Thr Pro Leu 
245 250 255 

Leu Leu Asn Gly Ser Leu Ala Glu Gly Gly Glu lie lie lie Arg Ser 
260 265 270 

Lys Asn Leu Ser Asn Asn Ala Tyr Thr lie lie Val His Leu Asn Asp 
275 280 285 

Ser Val Glu lie Val Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Gly 
290 295 300 

lie Arg lie Gly Pro Gly Gin Thr Phe Tyr Ala Thr Glu Asn lie lie 
305 310 315 320 

Gly Asp lie Arg Gin Ala His Cys Asn lie Ser Ala Gly Glu Trp Asn 
325 330 335 

Lys Ala Val Gin Arg Val Ser Ala Lys Leu Arg Glu His Phe Pro Asn 
340 345 350 

Lys Thr lie Glu Phe Gin Pro Ser Ser Gly Gly Asp Leu Glu lie Thr 
355 360 365 

Thr His Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Ser 
370 375 380 

Lys Leu Phe Asn Ser Ser Tyr Asn Gly Thr Ser Tyr Arg Gly Thr Glu 
385 390 395 400 

Ser Asn Ser Ser lie lie Thr Leu Pro Cys Arg lie Lys Gin lie lie 
405 410 415 

Asp Met Trp Gin Lys Val Gly Arg Ala lie Tyr Ala Pro Pro lie Glu 
420 425 430 

Gly Asn lie Thr Cys Ser Ser Ser lie Thr Gly Leu Leu Leu Ala Arg 
435 440 445 



Asp Gly Gly Leu Asp Asn lie Thr Thr Glu lie Phe Arg Pro Gin Gly 
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450 



455 



460 



Gly Asp Met Lys Asp Asn Trp Arg Asn Glu Leu Tyr Lys Tyr Lys Val 

465 470 475 480 

Val Glu lie Lys Pro Leu Gly Val Ala Pro Thr Glu Ala Lys Arg Arg 

485 490 495 



Val Val 



Gly Phe 



Glu Arg 
500 

Leu Gly 
515 



Glu Lys Arg Ala 



Ala Ala 



Gly Ser 
520 



Val Gly He Gly 
505 

Asn Met Gly Ala 



Ala Val He Phe 
510 

Ala Ser He Thr 
525 



Leu Thr 
530 



Ala Gin Ala Arg 



Gin Leu 
535 



Leu Ser 



Gly He 
540 



Val Gin Gin Gin 



Ser Asn 
545 



Leu Leu 



Arg Ala 
550 



He Glu Ala Gin 



Gin His Met Leu Gin Leu 
555 560 



Thr Val Trp Gly 



He Lys 
565 



Gin Leu 



Arg Tyr 
Lys Leu 



Leu Lys 
580 

He Cys 
595 



Asp Gin Gin Leu 



Gin Ala 
570 

Leu Gly 
585 



Arg Val 
He Trp 



Thr Thr 



Thr Val 
600 



Pro Trp Asn Ser 



Leu Ala He Glu 
575 

Gly Cys Ser Gly 
590 

Ser Trp Ser Asn 
605 



Lys Thr 
610 



Gin Gly Glu He 



Trp Glu 
615 



Asn Met 



Thr Trp 
620 



Met Gin Trp Asp 



Lys Glu 
625 



He Ser 



Asn Tyr 
630 



Thr Gly He He 



Tyr Arg Leu Leu Glu Glu 
635 640 



Ser Gin Asn Gin 



Gin Glu Gin Asn Glu Lys Asp Leu Leu Ala Leu Asp 
645 650 655 



Ser Arg 
Tyr He 



Asn Asn 
660 

Lys He 
675 



Leu Trp Ser Trp 



Phe Asn He Ser Asn Trp Leu Trp 
665 670 



Phe He 



Met He 
680 



Val Gly Gly Leu 



He Gly Leu Arg 
685 



He He 
690 



Phe Ala Val Leu 



Ser He 
695 



Val Asn 



Arg Val 
700 



Arg Gin Gly Tyr 



Ser Pro 
705 



Leu Ser 



Phe Gin 
710 



Thr Leu Thr Pro 



Asn Pro Arg Gly Leu Asp 
715 720 



Arg Leu Gly Arg 



He Glu Glu Glu Gly Gly Glu Gin Asp Arg Asp Arg 
725 730 735 



Ser He 



Arg Leu 
740 



Val Gin Gly Phe 



Leu Ala Leu Ala Trp Asp Asp Leu 
745 750 
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Arg Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu lie Leu 
755 760 765 

Val Thr Ala Arg Val Val Glu Leu Leu Gly Arg Ser Ser Pro Arg Gly 

770 775 780 



Leu Gin Arg Gly Trp Glu Ala Leu Lys Tyr Leu Gly Ser Leu Val Gin 
785 790 795 800 

Tyr Trp Gly Leu Glu Leu Lys Lys Ser Ala Thr Ser Leu Leu Asp Ser 
805 810 815 

He Ala He Ala Val Ala Glu Gly Thr Asp Arg He He Glu Val He 
820 825 830 

Gin Arg He Tyr Arg Ala Phe Cys Asn He Pro Arg Arg Val Arg Gin 
835 840 845 

Gly Phe Glu Ala Ala Leu Gin 
850 855 



<210> 25 
<211> 20 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 25 

Asp He Lys Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg 
15 10 15 

Phe Phe Lys Thr 
20 



<210> 26 
<211> 60 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 26 

gacataaaac aaggaccaaa agagcccttt agagactatg tagaccggtt ctttaaaacc 60 

<210> 27 
<211> 20 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 27 

Asp He Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg 
15 10 15 

Phe Phe Lys Thr 
20 
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<210> 28 
<211> 47 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 28 

Thr lie Thr lie Thr Cys Arg He Lys Gin He He Asn Met Trp Gin 
15 10 15 

Lys Val Gly Arg Ala Met Tyr Ala Pro Pro He Ala Gly Asn Leu Thr 
20 25 30 

Cys Glu Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly 
35 40 45 



<210> 29 
<211> 48 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 29 

Ser He He Thr Leu Pro Cys Arg He Lys Gin He He Asp Met Trp 
15 10 15 

Gin Lys Val Gly Arg Ala He Tyr Ala Pro Pro He Glu Gly Asn He 
20 25 30 

Thr Cys Ser Ser Ser He Thr Gly Leu Leu Leu Ala Arg Asp Gly Gly 
35 40 45 



<210> 30 
<211> 2469 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PR975{+) 



<400> 30 

gtcgacgcca 

cgcagcaact 

cacatcgccc 

ggccaccaga 

ttcccccagg 

agccgcgagc 

ggcaccctga 

ggcggccaga 

atgagcctgc 

gtgcgccagt 

ctgatcggcc 

accctgaact 

gacggcccca 

atctgcgagg 

aacacccccg 

ttccgcgagc 



ccatggccga 
tcaagggccc 
gcaactgccg 
tgaaggactg 
gcaaggcccg 
tgcaggtgcg 
acttccccca 
tcaaggaggc 
ccggcaagtg 
acgaccagat 
ccacccccgt 
tccccatcag 
aggtgaagca 
agatggagaa 
tgttcgccat 
tgaacaagcg 



ggccatgagc 
caagcgcatc 
cgccccccgc 
caccgagcgc 
cgagttcccc 
cggcgacaac 
gatcaccctg 
cctgctggac 
gaagcccaag 
cctgatcgag 
gaacatcatc 
ccccatcgag 
gtggcccctg 
ggagggcaag 
caagaagaag 
cacccaggac 



caggccacca 
atcaagtgct 
aagaagggct 
caggccaact 
agcgagcaga 
ccccgcagcg 
tggcagcgcc 
accggcgccg 
atgatcggcg 
atctgcggca 
ggccgcaaca 
accgtgcccg 
accgaggaga 
atcaccaaga 
gacagcacca 
ttctgggagg 



gcgccaacat 
tcaactgcgg 
gctggaagtg 
tcttccgcga 
accgcgccaa 
aggccggcgc 
ccctggtgag 
acgacaccgt 
gcatcggcgg 
agaaggccat 
tgctgaccca 
tgaagctgaa 
agatcaaggc 
tcggccccga 
agtggcgcaa 
tgcagctggg 



cctgatgcag 60 
caaggagggc 120 
cggcaaggag 180 
ggacctggcc 240 
cagccccacc 300 
cgagcgccag 360 
catcaaggtg 420 
gctggaggag 480 
cttcatcaag 540 
cggcaccgtg 600 
gctgggctgc 660 
gcccggcatg 720 
cctgaccgcc 780 
gaacccctac 840 
gctggtggac 900 
catcccccac 960 
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cccgccggcc 
agcgtgcccc 
aacgagaccc 
cccagcatct 
gagatcgtga 
cagcaccgcg 
cccgacaaga 
gacaagtgga 
atccagaagc 
cgccagctgt 
gaggaggccg 
gtgtactacg 
tggacctacc 
atgcgcaccg 
atggagagca 
acctgggaga 
gtgaacaccc 
gccgagacct 
tacgtgaccg 
accgagctgc 
accgacagcc 
ctggtgaacc 
cccgcccaca 
cgcaaggtgc 
gacctgtacg 
ggtgaattc 



tgaagaagaa 
tggacgagga 
ccggcatccg 
tccagagcag 
tctaccagta 
ccaagatcga 
agcaccagaa 
ccgtgcagcc 
tggtgggcaa 
gcaagctgct 
agctggagct 
accccagcaa 
agatctacca 
cccacaccaa 
tcgtgatctg 
cctggtggac 
cccccctggt 
tctacgtgga 
accggggccg 
aggccatcca 
agtacgccct 
agatcatcga 
agggcatcgg 
tgttcctgga 
tgggcagcgg 



gaagagcgtg 
cttccgcaag 
ctaccagtac 
catgaccaag 
catggacgac 
ggagctgcgc 
ggagcccccc 
catcgagctg 
gctgaactgg 
gcgcggcgcc 
ggccgagaac 
ggacctggtg 
ggagcccttc 
cgacgtgaag 
gggcaagacc 
cgactactgg 
gaagctgtgg 
cggcgccgcc 
gcagaagatc 
gctggccctg 
gggcatcatc 
gcagctgatc 
cggcaacgag 
cggcatcgat 
cggccctagg 



accgtgctgg 
tacaccgcct 
aacgtgctgc 
atcctggagc 
ctgtacgtgg 
aagcacctgc 
ttcctgtgga 
cccgagaagg 
gccagccaga 
aaggccctga 
cgcgagatcc 
gccgagatcc 
aagaacctga 
cagctgaccg 
cccaagttcc 
caggccacct 
taccagctgg 
aaccgcgaga 
gtgagcctga 
caggacagcg 
caggcccagc 
aagaaggaga 
cagatcgaca 
ggcggcatcg 
atcgattaaa 



acgtgggcga 
tcaccatccc 
cccagggctg 
ccttccgcgc 
gcagcgacct 
tgcgctgggg 
tgggctacga 
agagctggac 
tctaccccgg 
ccgacatcgt 
tgcgcgagcc 
agaagcaggg 
agaccggcaa 
aggccgtgca 
gcctgcccat 
ggatccccga 
agaaggagcc 
ccaagatcgg 
ccgagaccac 
gcagcgaggt 
ccgacaagag 
aggtgtacct 
agctggtgag 
tgatctacca 
agcttcccgg 



cgcctacttc 
cagcatcaac 
gaagggcagc 
ccgcaacccc 
ggagatcggc 
cttcaccacc 
gctgcacccc 
cgtgaacgac 
catcaaggtg 
gcccctgacc 
cgtgcacggc 
ccacgaccag 
gtacgccaag 
gaagatcgcc 
ccagaaggag 
gtgggagttc 
catcatcggc 
caaggccggc 
caaccagaag 
gaacatcgtg 
cgagagcgag 
gagctgggtg 
caagggcatc 
gtacatggac 
ggctagcacc 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2469 



<210> 31 
<211> 2463 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 



PR975YM 



<400> 31 

gtcgacgcca 

cgcagcaact 

cacatcgccc 

ggccaccaga 

ttcccccagg 

agccgcgagc 

ggcaccctga 

ggcggccaga 

atgagcctgc 

gtgcgccagt 

ctgatcggcc 

accctgaact 

gacggcccca 

atctgcgagg 

aacacccccg 

ttccgcgagc 

cccgccggcc 

agcgtgcccc 

aacgagaccc 

cccagcatct 

gagatcgtga 



ccatggccga 
tcaagggccc 
gcaactgccg 
tgaaggactg 
gcaaggcccg 
tgcaggtgcg 
acttccccca 
tcaaggaggc 
ccggcaagtg 
acgaccagat 
ccacccccgt 
tccccatcag 
aggtgaagca 
agatggagaa 
tgttcgccat 
tgaacaagcg 
tgaagaagaa 
tggacgagga 
ccggcatccg 
tccagagcag 
tctaccaggc 



ggccatgagc 
caagcgcatc 
cgccccccgc 
caccgagcgc 
cgagttcccc 
cggcgacaac 
gatcaccctg 
cctgctggac 
gaagcccaag 
cctgatcgag 
gaacatcatc 
ccccatcgag 
gtggcccctg 
ggagggcaag 
caagaagaag 
cacccaggac 
gaagagcgtg 
cttccgcaag 
ctaccagtac 
catgaccaag 
ccccctgtac 



caggccacca 
atcaagtgct 
aagaagggct 
caggccaact 
agcgagcaga 
ccccgcagcg 
tggcagcgcc 
accggcgccg 
atgatcggcg 
atctgcggca 
ggccgcaaca 
accgtgcccg 
accgaggaga 
atcaccaaga 
gacagcacca 
ttctgggagg 
accgtgctgg 
tacaccgcct 
aacgtgctgc 
atcctggagc 
gtgggcagcg 



gcgccaacat 
tcaactgcgg 
gctggaagtg 
tcttccgcga 
accgcgccaa 
aggccggcgc 
ccctggtgag 
acgacaccgt 
gcatcggcgg 
agaaggccat 
tgctgaccca 
tgaagctgaa 
agatcaaggc 
tcggccccga 
agtggcgcaa 
tgcagctggg 
acgtgggcga 
tcaccatccc 
cccagggctg 
ccttccgcgc 
acctggagat 



cctgatgcag 
caaggagggc 
cggcaaggag 
ggacctggcc 
cagccccacc 
cgagcgccag 
catcaaggtg 
gctggaggag 
cttcatcaag 
cggcaccgtg 
gctgggctgc 
gcccggcatg 
cctgaccgcc 
gaacccctac 
gctggtggac 
catcccccac 
cgcctacttc 
cagcatcaac 
gaagggcagc 
ccgcaacccc 
cggccagcac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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cgcgccaaga 
aagaagcacc 
tggaccgtgc 
aagctggtgg 
ctgtgcaagc 
gccgagctgg 
tacgacccca 
taccagatct 
accgcccaca 
agcatcgtga 
gagacctggt 
accccccccc 
accttctacg 
accgaccggg 
ctgcaggcca 
agccagtacg 
aaccagatca 
cacaagggca 
gtgctgttcc 
tacgtgggca 
ttc 



tcgaggagct 
agaaggagcc 
agcccatcga 
gcaagctgaa 
tgctgcgcgg 
agctggccga 
gcaaggacct 
accaggagcc 
ccaacgacgt 
tctggggcaa 
ggaccgacta 
tggtgaagct 
tggacggcgc 
gccggcagaa 
tccagctggc 
ccctgggcat 
tcgagcagct 
tcggcggcaa 
tggacggcat 
gcggcggccc 



gcgcaagcac 
ccccttcctg 
gctgcccgag 
ctgggccagc 
cgccaaggcc 
gaaccgcgag 
ggtggccgag 
cttcaagaac 
gaagcagctg 
gacccccaag 
ctggcaggcc 
gtggtaccag 
cgccaaccgc 
gatcgtgagc 
cctgcaggac 
catccaggcc 
gatcaagaag 
cgagcagatc 
cgatggcggc 
taggatcgat 



ctgctgcgct 
tggatgggct 
aaggagagct 
cagatctacc 
ctgaccgaca 
atcctgcgcg 
atccagaagc 
ctgaagaccg 
accgaggccg 
ttccgcctgc 
acctggatcc 
ctggagaagg 
gagaccaaga 
ctgaccgaga 
agcggcagcg 
cagcccgaca 
gagaaggtgt 
gacaagctgg 
atcgtgatct 
taaaagcttc 



ggggcttcac 
acgagctgca 
ggaccgtgaa 
ccggcatcaa 
tcgtgcccct 
agcccgtgca 
agggccacga 
gcaagtacgc 
tgcagaagat 
ccatccagaa 
ccgagtggga 
agcccatcat 
tcggcaaggc 
ccaccaacca 
aggtgaacat 
agagcgagag 
acctgagctg 
tgagcaaggg 
accagtacat 
ccggggctag 



cacccccgac 
ccccgacaag 
cgacatccag 
ggtgcgccag 
gaccgaggag 
cggcgtgtac 
ccagtggacc 
caagatgcgc 
cgccatggag 
ggagacctgg 
gttcgtgaac 
cggcgccgag 
cggctacgtg 
gaagaccgag 
cgtgaccgac 
cgagctggtg 
ggtgcccgcc 
catccgcaag 
ggacgacctg 
caccggtgaa 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2463 



<210> 32 
<211> 2457 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PR975YMWM 



<400> 32 

gtcgacgcca 

cgcagcaact 

cacatcgccc 

ggccaccaga 

ttcccccagg 

agccgcgagc 

ggcaccctga 

ggcggccaga 

atgagcctgc 

gtgcgccagt 

ctgatcggcc 

accctgaact 

gacggcccca 

atctgcgagg 

aacacccccg 

ttccgcgagc 

cccgccggcc 

agcgtgcccc 

aacgagaccc 

cccagcatct 

gagatcgtga 

cgcgccaaga 

aagaagcacc 

gtgcagccca 

gtgggcaagc 

aagctgctgc 



ccatggccga 
tcaagggccc 
gcaactgccg 
tgaaggactg 
gcaaggcccg 
tgcaggtgcg 
acttccccca 
tcaaggaggc 
ccggcaagtg 
acgaccagat 
ccacccccgt 
tccccatcag 
aggtgaagca 
agatggagaa 
tgttcgccat 
tgaacaagcg 
tgaagaagaa 
tggacgagga 
ccggcatccg 
tccagagcag 
tctaccaggc 
tcgaggagct 
agaaggagcc 
tcgagctgcc 
tgaactgggc 
gcggcgccaa 



ggccatgagc 
caagcgcatc 
cgccccccgc 
caccgagcgc 
cgagttcccc 
cggcgacaac 
gatcaccctg 
cctgctggac 
gaagcccaag 
cctgatcgag 
gaacatcatc 
ccccatcgag 
gtggcccctg 
ggagggcaag 
caagaagaag 
cacccaggac 
gaagagcgtg 
cttccgcaag 
ctaccagtac 
catgaccaag 
ccccctgtac 
gcgcaagcac 
ccccttcctg 
cgagaaggag 
cagccagatc 
ggccctgacc 



caggccacca 
atcaagtgct 
aagaagggct 
caggccaact 
agcgagcaga 
ccccgcagcg 
tggcagcgcc 
accggcgccg 
atgatcggcg 
atctgcggca 
ggccgcaaca 
accgtgcccg 
accgaggaga 
atcaccaaga 
gacagcacca 
ttctgggagg 
accgtgctgg 
tacaccgcct 
aacgtgctgc 
atcctggagc 
gtgggcagcg 
ctgctgcgct 
cccatcgagc 
agctggaccg 
taccccggca 
gacatcgtgc 



gcgccaacat 
tcaactgcgg 
gctggaagtg 
tcttccgcga 
accgcgccaa 
aggccggcgc 
ccctggtgag 
acgacaccgt 
gcatcggcgg 
agaaggccat 
tgctgaccca 
tgaagctgaa 
agatcaaggc 
tcggccccga 
agtggcgcaa 
tgcagctggg 
acgtgggcga 
tcaccatccc 
cccagggctg 
ccttccgcgc 
acctggagat 
ggggcttcac 
tgcaccccga 
tgaacgacat 
tcaaggtgcg 
ccctgaccga 



cctgatgcag 
caaggagggc 
cggcaaggag 
ggacctggcc 
cagccccacc 
cgagcgccag 
catcaaggtg 
gctggaggag 
cttcatcaag 
cggcaccgtg 
gctgggctgc 
gcccggcatg 
cctgaccgcc 
gaacccctac 
gctggtggac 
catcccccac 
cgcctacttc 
cagcatcaac 
gaagggcagc 
ccgcaacccc 
cggccagcac 
cacccccgac 
caagtggacc 
ccagaagctg 
ccagctgtgc 
ggaggccgag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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ctggagctgg 
cccagcaagg 
atctaccagg 
cacaccaacg 
gtgatctggg 
tggtggaccg 
cccctggtga 
tacgtggacg 
cggggccggc 
gccatccagc 
tacgccctgg 
atcatcgagc 
ggcatcggcg 
ttcctggacg 
ggcagcggcg 



ccgagaaccg 
acctggtggc 
agcccttcaa 
acgtgaagca 
gcaagacccc 
actactggca 
agctgtggta 
gcgccgccaa 
agaagatcgt 
tggccctgca 
gcatcatcca 
agctgatcaa 
gcaacgagca 
gcatcgatgg 
gccctaggat 



cgagatcctg 
cgagatccag 
gaacctgaag 
gctgaccgag 
caagttccgc 
ggccacctgg 
ccagctggag 
ccgcgagacc 
gagcctgacc 
ggacagcggc 
ggcccagccc 
gaaggagaag 
gatcgacaag 
cggcatcgtg 
cgattaaaag 



cgcgagcccg 
aagcagggcc 
accggcaagt 
gccgtgcaga 
ctgcccatcc 
atccccgagt 
aaggagccca 
aagatcggca 
gagaccacca 
agcgaggtga 
gacaagagcg 
gtgtacctga 
ctggtgagca 
atctaccagt 
cttcccgggg 



tgcacggcgt 
acgaccagtg 
acgccaagat 
agatcgccat 
agaaggagac 
gggagttcgt 
tcatcggcgc 
aggccggcta 
accagaagac 
acatcgtgac 
agagcgagct 
gctgggtgcc 
agggcatccg 
acatggacga 
ctagcaccgg 



gtactacgac 
gacctaccag 
gcgcaccgcc 
ggagagcatc 
ctgggagacc 
gaacaccccc 
cgagaccttc 
cgtgaccgac 
cgagctgcag 
cgacagccag 
ggtgaaccag 
cgcccacaag 
caaggtgctg 
cctgtacgtg 
tgaattc 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2457 



<210> 33 
<211> 9781 
<212> DNA 

<213> Human immunodeficiency virus 



<400> 33 

tggaagggtt 

cacaaggctt 

tgacctttgg 

acggaggaga 

atagagaagt 

agctacatcc 

tccactgggg 

atgctgcata 

gagcctggga 

cttgagtgct 

tcagaccctt 

agtgaaagtg 

ggcaagaggc 

gaaggagaga 

aaagaattag 

caagcaggga 

gtaaacaaat 

cattattcaa 

ccaaggaagc 

aggcaaaagc 

ggcaaatggt 

aggaaaaggc 

ccccacaaga 

tgttaaaaga 

cagggcctgt 

ctagtaccct 

acatctataa 

ttagcatttt 

tctttaaaac 

ccttgttggt 

gggcctcatt 

caagagtgtt 

attttaaagg 

ccagaaattg 

aaatgaaaga 



aatttactcc 
cttccctgat 
atggtgctac 
agacaactgt 
attaaagtgg 
ggagtattac 
cgttccggga 
taagcagctg 
gccctctggc 
ttaagtagtg 
tgtggtagtg 
agaccagagg 
gagaggggcg 
gatgggtgcg 
gttaaggcca 
gctggaaaga 
aataaaacag 
cacagtagca 
cttagacaag 
agctgacgaa 
acaccaagct 
tttcaatcca 
tttaaacaca 
taccatcaat 
tgcaccaggc 
tcaggaacaa 
aagatggata 
ggacataaaa 
cttaagagct 
ccaaaatgcg 
agaagaaatg 
ggctgaggca 
ctctaacaga 
cagggcccct 
ctgtactgag 



aagaaaaggc 
tggcaaaact 
aagctagtgc 
ttgctacacc 
aagtttgaca 
aaagactgct 
ggtgtggtct 
cttttcgcct 
tatctaggga 
tgtgcccatc 
tggaaaatct 
agatctctcg 
gctggtgagt 
agagcgtcaa 
gggggaaaga 
tttgcactta 
ctacaaccag 
actctctatt 
atagaggaag 
aaggtcagtc 
atatcaccta 
gaggaaatac 
atgttaaata 
gaggaggctg 
cagatgagag 
atagcatgga 
attctggggt 
caagggccaa 
gaacaagcta 
aacccagatt 
atgacagcat 
atgagccaag 
attattaaat 
aggaaaaagg 
aggcaggcta 



aagaaatcct 
acacaccggg 
cagttgaccc 
ctatgagcca 
gcctcctagc 
gacacagaag 
gggcgggact 
gtactgggtc 
acccactgct 
tgttgtgtga 
ctagcagtgg 
acgcaggact 
acgccaattt 
tattaagcgg 
aacattatat 
accctggcct 
ctcttcagac 
gtgtacataa 
aacaaaacaa 
aaaattatcc 
gaacattgaa 
ccatgtttac 
cagtgggggg 
cagaatggga 
aaccaagggg 
tgacaagtaa 
taaataaaat 
aagaaccctt 
cacaagatgt 
gtaagaccat 
gtcagggagt 
caaacagtaa 
gtttcaactg 
gctgttggaa 
attttttagg 



tgatttgtgg 
gccaggggtc 
aggggaggtg 
acatggagca 
acgcagacac 
ggactttccg 
tgggagtggt 
tctctcggta 
taagcctcaa 
ctctggtaac 
cgcccgaaca 
cggcttgctg 
tacttgacta 
cggaaaatta 
gttaaaacat 
gttagaaaca 
aggaacagag 
agggatagag 
atgtcagcaa 
tatagtacag 
tgcatggata 
agcattatca 
acatcaagca 
taggacacat 
aagtgacata 
tccacctatt 
agtaagaatg 
tagagactat 
aaagaattgg 
tttaagagca 
gggaggacct 
catactagtg 
tggcaaagta 
atgtggacag 
gaaaatttgg 



gtctatcaca 
agatatccac 
gaagaggcca 
gaggatgaag 
atggcccgcg 
cctgggactt 
caaccctcag 
gaccagatct 
taaagcttgc 
tagagatccc 
gggaccagaa 
aagtgcacac 
gcggaggcta 
gataaatggg 
ctagtatggg 
tcagaaggct 
gaacttagat 
gtacgagaca 
aaagcacaac 
aatgcccaag 
aaagtaatag 
gaaggagcca 
gccatgcaaa 
ccagtacatg 
gcaggaacta 
ccagtagaag 
tatagccctg 
gtagaccggt 
atgacagaca 
ttaggaccag 
agccataaag 
cagagaagca 
gggcacatag 
gaaggacacc 
ccttcccaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 
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aggggaggcc 
caacagcccc 
agaaagagag 
aataaaagta 
attagaagaa 
ttttatcaaa 
aggtacagta 
gcttggatgc 
accaggaatg 
attaacagca 
taatccatat 
attagtagat 
aataccacac 
tgcatatttt 
tagtataaac 
gaaaggatca 
aaaaaatcca 
agaaataggg 
atttacaaca 
actccatcct 
tgtcaatgat 
gattaaagta 
accactaact 
agtacatgga 
gcatgaacaa 
gtatgcaaaa 
aaaaatagcc 
ccaaaaagaa 
gtgggagttt 
catagcagga 
aaaagcaggg 
aaatcagaag 
aaacatagta 
tgactcagag 
gtcatgggta 
taagggaatt 
aaggtaccac 
aaaagaaata 
agtcgactgt 
cctggtagca 
aggacaagaa 
acatacagac 
aggtatccaa 
catgaataaa 
gacagcagta 
gtacagtgca 
acaaaaacaa 
tatttggaaa 
taaaggtgac 
acagatggca 
gtttagtaaa 
attttgaaag 
gattagtaat 
atggagtctc 
cagaccagct 
ccatattagg 
gatctctgca 



agggaatttc 
accagcagag 
ggaaccttta 
gagggccaga 
atagatttgc 
gtaagacagt 
ttagtagggc 
acactaaatt 
gatggcccaa 
atttgtgagg 
aacactccag 
ttcagggaac 
ccagcaggat 
tcagttcctt 
aatgaaacac 
ccagcaatat 
gacatagtta 
caacatagag 
ccagacaaga 
gacaaatgga 
atacagaagt 
aggcaactct 
gaagaagcag 
gtatattatg 
tggacatatc 
atgaggacta 
atggaaagca 
acatgggaga 
gttaataccc 
gtagaaactt 
tatgttactg 
actgagttac 
acagactcac 
atatttaacc 
ccagcacata 
aggaaagtgt 
agcaattgga 
gtagctagct 
agtccaggga 
gtccatgtag 
acagcatatt 
aatggcagta 
caggaatttg 
gaattaaaga 
caaatggcag 
ggggaaagaa 
attataagaa 
ggaccagccg 
ataaaggtag 
ggtgctgatt 
gcaccatatg 
cagacatcca 
aaaaacatat 
catagaatgg 
aattcacatg 
acacatagtt 
atacttggca 



ctccagaaca 
agcttcaggt 
acttccctca 
taaaggaggc 
cagggaaatg 
atgatcaaat 
ctacaccagt 
ttccaattag 
aggtcaaaca 
aaatggagaa 
tatttgccat 
tcaataaaag 
taaaaaagaa 
tagatgaaag 
cagggattag 
tccagagtag 
tctatcaata 
caaaaataga 
aacatcaaaa 
cagtacaacc 
tagtgggaaa 
gtaaactcct 
aattagaatt 
atccatcaaa 
aaatttatca 
cccacactaa 
tagtaatatg 
catggtggac 
ctcccctagt 
tctatgtaga 
acagaggaag 
aagcaattca 
agtatgcatt 
aaataataga 
aaggaattgg 
tgtttctaga 
gagcaatggc 
gtgataaatg 
tatggcaatt 
ctagtggcta 
ttatattaaa 
attttaccag 
gaattcccta 
aaataatagg 
tattcattca 
taatagacat 
ttcaaaattt 
aactactctg 
taccaaggag 
gtgtggcagg 
tatatatcaa 
aaagtaagtt 
tggggtttgc 
agactgagag 
cattattttg 
tttcctaggt 
ctgacagcat 



gaccagagcc 
tcgaggagac 
aatcactctt 
tctcttagac 
gaaaccaaaa 
acttatagaa 
caacataatt 
tcctattgaa 
atggccattg 
ggaaggaaaa 
aaaaaagaag 
aactcaagac 
aaaatcagtg 
cttcaggaaa 
atatcaatat 
catgacaaaa 
tatggatgac 
agagttaagg 
agaaccccca 
tatactgctg 
attaaactgg 
caggggggcc 
ggcagagaac 
agacttgata 
agaaccattt 
tgatgtaaaa 
gggaaagact 
agactattgg 
aaaattatgg 
tggagcaact 
gcagaaaatt 
gctagctctg 
aggaatcatt 
acagttaata 
gggaaatgaa 
tggaatagat 
taatgagttt 
tcagctaaaa 
agattgtacc 
catggaagca 
attagcagga 
tactgcagtt 
caatccccaa 
acaagtaaga 
caattttaaa 
aatagcaaca 
tcgggtttat 
gaaaggtgaa 
gaaagcaaaa 
tggacaggat 
ggagagctag 
cagaagtaca 
agacaggaga 
aatacagcac 
attgttttac 
gtgactatca 
tgataaaacc 



aacagcccca 
aacccccgtg 
tggcagcgac 
acaggagcag 
atgatagggg 
atttgtggaa 
ggaagaaatc 
actgtaccag 
acagaagaaa 
attacaaaaa 
gacagtacta 
ttttgggaag 
acagtgctag 
tatactgcat 
aatgtgctgc 
atcttagagc 
ttgtatgtag 
gaacatttat 
tttctttgga 
ccagaaaagg 
gcaagtcaga 
aaagcactaa 
agggaaattt 
gctgaaatac 
aaaaatctga 
cagttaacag 
cctaaattta 
caagccacct 
taccaactag 
aatagggaag 
gttactctaa 
caggattcag 
caagcacaac 
aacaaggaaa 
caagtagata 
aaagctcaag 
aatctgccac 
ggggaagcca 
catttagagg 
gaggttatcc 
agatggccag 
aaggcagcct 
agtcagggag 
gatcaagctg 
agaaaagggg 
gacatacaaa 
tacagagaca 
ggggtagtag 
atcattagag 
gaagattaga 
tggatgggtc 
tatcccatta 
aagagattgg 
acaagtagac 
agaatctgcc 
agcaggacat 
aaaaaagaga 



ccagcagaac 
ccgaggaagg 
cccttgtctc 
atgatacagt 
gaattggagg 
aaaaggctat 
tgttaactca 
taaaattaaa 
aaataaaagc 
ttgggcctga 
agtggagaaa 
ttcaattagg 
atgtggggga 
tcaccatacc 
cacagggatg 
ccttcagagc 
gatctgactt 
tgaaatgggg 
tggggtatga 
atagttggac 
tttacccagg 
cagacatagt 
taagagaacc 
agaaacaggg 
aaacagggaa 
aggcagtgca 
gactacccat 
ggatccctga 
aaaaagatcc 
ctaaaatagg 
ctaacacaac 
gatcagaagt 
cagataagag 
gaatctacct 
aattagtaag 
aagagcatga 
ccatagtagc 
tacatggaca 
gaaaaatcat 
cagcagaaac 
tcaaagtaat 
gttggtgggc 
tggtagaatc 
agcaccttaa 
gaattggggg 
ctaaagaatt 
gcagagaccc 
taatagaaga 
attatggaaa 
gcatggaata 
tacagacatc 
ggggatgcta 
catttgggtc 
cctgacctgg 
ataagacaag 
aagaaggtag 
aagccacctc 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
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tgcctagtgt 
gcagagggaa 
tgtcagacac 
tggggatact 
tcatttcaga 
aaatggagcc 
cagcttgtaa 
caaaaggttt 
caagtggtga 
tggtaagttt 
tagcactaat 
taagacaaaa 
gcaatgagag 
ggcttctgga 
gtacctgtgt 
gagacagaag 
caagaaatag 
gatcagatgc 
ttgaccccac 
gttacaggta 
aaaaattgct 
ctcttttata 
ttaataaatt 
attcctatac 
ttcaatggga 
ccagtggtat 
agatctgaaa 
gagattaatt 
caagcattct 
agtacagata 
cctaataaaa 
agctttaatt 
tactacccta 
caatgcaaaa 
cctcccattg 
gatgggggat 
gatatgaggg 
ttgggaatag 
ggaataggag 
tcaataacgc 
aatttgctga 
aagcagctcc 
gggatttggg 
tggagtaata 
gaaattaata 
gaaaagaatg 
gacatatcaa 
ggtttaagaa 
cctttgtcat 
gaagaagaag 
tcgcttgcct 
ttcatattaa 
cagagggggt 
ctaaaaaaga 
gataggatta 
ataagacagg 
catagttgga 



tagaaaatta 
ccatacaatg 
tttcctagac 
tggacgggag 
attggatgcc 
agtagatcct 
taattgcttt 
aggcatttcc 
agatcatcaa 
aagtttattt 
catagcaata 
gaaaatagac 
tgatggggac 
tgctaatgat 
ggagagaagc 
tgcataatgt 
ttttgggaaa 
atgaggatat 
tctgtgtcac 
atacaaatga 
ctttcaatgc 
aacttgatat 
gcaatacctc 
attactgtgc 
caggaccatg 
caactcaact 
atttgacaga 
gtacaaggcc 
atgcaacaaa 
gatggaataa 
caataaaatt 
gtagaggaga 
agaatggtac 
taaaacaaat 
caggaaacat 
ttaacaacac 
ataactggag 
cacccactaa 
ctgtgttcct 
tgacggtaca 
aggctataga 
aggcgagagt 
gctgctctgg 
aatctgaagc 
attacacaga 
aaaaagattt 
actggctgtg 
taatttttgc 
ttcagaccct 
gtggagagca 
gggacgatct 
ttgcagtgag 
gggagatcct 
gtgctattag 
tagaattggt 
gctttgaagc 
tggcctgcag 



gtagaggata 
aatggacact 
catggctcca 
ttgaagctat 
aacatagcag 
aaactaaagc 
tgcaaacact 
tatggcagga 
aatcctctat 
aaaggagtag 
atagtgtgga 
tggttaatta 
acagaagaat 
ttgtaacacg 
aaaaactact 
ctgggctaca 
tgtaacagaa 
aatcagttta 
tttaaactgt 
taccaatatt 
aaccacagaa 
agtaccactt 
aaccataaca 
tccagctgat 
ttataatgtc 
actgttaaat 
gaataccaaa 
caacaataat 
tgacgtaata 
aactttacaa 
tgaaccacat 
atttttctat 
atacaaatac 
tgtacgcatg 
aacatgtaga 
aaacaacgac 
aagtgaatta 
ggcaaaaaga 
tgggttcttg 
ggccagacaa 
ggcgcaacag 
cctggctata 
aagactcatc 
agatatttgg 
aacaatattc 
attagaattg 
gtatataaaa 
tgtgctctct 
taccccaagc 
agacagagac 
gcggagcctg 
ggcagtggaa 
taagtatctg 
tccgcttgat 
acaaagaatt 
agctttgcta 
taagagaaag 



gatggaacga 
agagattcta 
tagcttagga 
aataagagta 
aataggcatc 
cctggaacca 
gtagctatca 
agaagcggag 
caaagcagta 
attatagatt 
ccatagcata 
aaagaattag 
tgtcaacaat 
gaggacttgt 
ctattctgtg 
catgcttgtg 
aattttaata 
tgggatcaaa 
acagatacaa 
gcaaatgcta 
ttaagagata 
aatgaaaata 
caagcctgtc 
tatgcgattc 
agcacagtac 
ggtagtctag 
acaataatag 
acaaggaaaa 
ggaaacataa 
caggtaatga 
gcaggagggg 
tgcaatacat 
aatggtaatt 
tggcaagggg 
tcaaacatca 
acagaggaga 
tataaatata 
agagtggtgc 
ggagcagcag 
ctgttgtctg 
catatgttgc 
gaaagatacc 
tgcaccactg 
gataacatga 
aggttgcttg 
gacaagtgga 
atattcataa 
atagtgaata 
ccgaggggac 
agatccatac 
tgcctcttca 
cttctgggac 
ggaagtcttg 
accatagcaa 
tgtagagcta 
taaaatggga 
aatgagaaga 



cccccagaag 
gaagaactca 
caatatatct 
ctgcaacaac 
ttgcgacaga 
tccaggaagc 
ttgtctagtt 
acagcgacga 
agtacacata 
aggagtagga 
tatagaatat 
ggaaagagca 
ggtggatatg 
gggtcacagt 
catcagatgc 
tacccacaga 
tgtggaaaaa 
gcctaaagcc 
atgttacagg 
catataagta 
agaaacataa 
gtaacaactt 
caaaggtctc 
taaagtgtaa 
aatgtacaca 
cagaagaagg 
tacatcttaa 
gtgtaaggat 
gacaagcaca 
aaaaattagg 
atctagaaat 
caaacctgtt 
caagcttacc 
taggacaagc 
caggaatact 
cattcagacc 
aagtggtaga 
agagaaaaaa 
gaagcactat 
gtatagtgca 
aactcacagt 
taaaggatca 
ctgtgccttg 
cttggatgca 
aagactcgca 
ataatctgtg 
tgatagtagg 
gagttaggca 
tcgacaggct 
gattggtgag 
gctaccaccg 
acagcagtct 
tgcagtattg 
tagcagtagc 
tcctcaacat 
ggcaagtggt 
actgagccag 



accaggggcc 
agcaggaagc 
atgaaaccta 
tactgttcat 
gaagagcaag 
caacctaaaa 
tgctttcaga 
agcgctcctc 
gtagatgtaa 
gcattgatag 
aggaaattgg 
gaagacagtg 
gggcatctta 
ctactatggg 
taaagcatat 
ccccaaccca 
taacatggca 
atgtgtaaag 
taatagaact 
tgaagaaatg 
agagtatgca 
tacatataga 
ttttgacccg 
taataagaca 
tggaattaag 
gataataatt 
tgaatctgta 
aggaccagga 
ttgtaacatt 
agagcatttc 
tacaatgcat 
taatagtaca 
catcacactc 
aatgtatgcc 
attgacacgt 
tggaggagga 
aattaagcca 
aagagcagtg 
gggcgcagcg 
acagcaaagc 
ctggggcatt 
acagctccta 
gaactccagt 
gtgggataga 
aaaccagcag 
gaattggttt 
aggcttgata 
gggatactca 
cggaggaatc 
cggattcttg 
cttgagagac 
caggggacta 
gggtctagag 
tgaaggaaca 
acctaggaga 
caaaacgcag 
cagcagaggg 



5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
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agtaggagca 
tgctactaat 
tccagtcaga 
cttcttttta 
aatccttgat 
atcggggcca 
tgacccaagg 
gagccaacat 
tctagcacac 
cagaagggac 
gggacttggg 
gggtctctct 
ctgcttaggc 
tgtgactctg 



gcgtctcaag 
gaagcttgtg 
cctcaggtac 
aaagaaaagg 
ttgtgggtct 
ggggtccgat 
gaggtgaaag 
ggagcagagg 
agacacatgg 
tttccgcctg 
agtggtcacc 
cggtagacca 
ctcaataaag 
gtaactagag 



acttagatag 
cctggctgca 
ctttaagacc 
ggggactgga 
ataacacaca 
tcccactgac 
aggccaatga 
atgaagatag 
cccgcgagct 
ggactttcca 
ctcagatgct 
gatctgagcc 
cttgccttga 
atccctcaga 



acatggggca 
agcacaagag 
aatgacttat 
agggttaatt 
aggcttcttc 
ctttggatgg 
aggagaagac 
agaagtatta 
acatccggag 
ctggggcgtt 
gcatataagc 
tgggagctct 
gtgctctaag 
ccctttgtgg 



cttacaagca 
gaggacggag 
aagagtgcag 
tactctagga 
cctgattggc 
tgcttcaagc 
aactgtttgc 
aagtggaagt 
tattacaaag 
ccgggaggtg 
agctgctttt 
ctggctatct 
tagtgtgtgc 
tagtgtggaa 



gcaacacacc 
atgtaggctt 
tagatctcag 
aaaggcaaga 
aaaactacac 
tagtaccagt 
tacaccctat 
ttgacagcct 
actgctgaca 
tggtctgggc 
cgcttgtact 
agggaaccca 
ccatctgttg 
aatctctagc 



9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9781 



<210> 34 
<211> 203 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 34 

gctgaggcaa tgagccaagc aaccagcgca aacatactga tgcagagaag caatttcaaa 
ggccctaaaa gaattattaa atgtttcaac tgtggcaagg aagggcacat agctagaaat 
tgtagggccc ctaggaaaaa aggctgttgg aaatgtggaa aggaaggaca ccaaatgaaa 
gactgtactg agaggcaggc taa 



60 
120 
180 
203 



<210> 35 
<211> 2151 
<212> DNA 

<213> Human immunodeficiency virus 



<400> 35 

ttttttaggg 

aacagagcca 

gaagcaggag 

ccccttgtct 

gatgatacag 

ggaattggag 

aaaaaggcta 

atgttgactc 

gtaaaattaa 

aaaataaaag 

attgggcctg 

aagtggagaa 

gttcaattag 

gatgtggggg 

ttcaccatac 

ccacagggat 

ccctttagag 

ggatctgact 

ttaaggtggg 

atggggtatg 

gaaagctgga 

atttacccag 

acagatatag 

ctaagagaac 



aagatttggc 
acagccccac 
ccgaaagaca 
caataaaagt 
tattagaaga 
gttttatcaa 
taggtacagt 
agcttggatg 
agccaggaat 
cattaacagc 
aaaatccata 
agttagtaga 
gaataccaca 
atgcatattt 
ctagtataaa 
ggaaaggatc 
caagaaatcc 
tagaaatagg 
gatttaccac 
aactccatcc 
ctgtcaatga 
gaattaaagt 
taccactaac 
cagtacatgg 



cttcccacaa 
cagcagagag 
gggaaccctt 
agggggtcaa 
aatgagtttg 
agtaagacag 
attaatagga 
cacactaaat 
ggatggccca 
aatttgtgaa 
taacactcca 
tttcagggaa 
cccagcaggg 
ttcagttcct 
caatgaaaca 
accatcaata 
agaaatagtc 
gcaacataga 
accggacaag 
tgacaaatgg 
tatacagaag 
aaggcaactt 
tgaagaagca 
agtatattat 



gggaaggcca 
cttcaagttc 
aatttccctc 
ataaaggagg 
ccaggaaaat 
tatgatcaaa 
cctacacctg 
tttccaatta 
aaggttaaac 
gaaatggaga 
gtatttgcca 
cttaataaaa 
ttaaaaaaga 
ttagatgagg 
ccagggatta 
ttccagagta 
atctatcaat 
gcaaaaatag 
aaacatcaga 
acagtacagc 
ttagtgggaa 
tgtaaactcc 
gaattagaat 
gacccatcaa 



gggaatttcc 
gaggagacaa 
aaatcactct 
ctctcttaga 
ggaaaccaaa 
tacttataga 
tcaacataat 
gtcccattga 
aatggccatt 
aagaaggaaa 
taaaaaagaa 
gaactcaaga 
aaaaatcagt 
acttcaggaa 
gatatcaata 
gcatgacaaa 
atatggatga 
aggagttaag 
aagaaccccc 
ctatagagtt 
aattaaattg 
ttaggggggc 
tggcagagaa 
aagacttggt 



ttcagaacag 
cccccgctcc 
ttggcagcga 
cacaggagct 
aatgatagga 
aatttgtgga 
tggaaggaat 
aactgtgcca 
gacagaagag 
aattacaaaa 
ggacagtact 
cttttgggaa 
gacagtactg 
atatactgca 
taatgtgctt 
aatcttagag 
cttgtatgta 
aaaacatctg 
atttctttgg 
gccagaaaag 
ggccagtcag 
caaagcacta 
cagggaaatt 
agctgaaata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 
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cagaaacagg 
aaaacaggga 
gaggcagtgc 
agactaccca 
tggattcctg 
gagaaagaac 
actaaaatag 
acagaaacaa 
ggatcagaag 
ccagataaga 
aaggtctacc 
aaattagtaa 



ggcatgacca 
agtatgcaaa 
aaaaaatagc 
tccaaaaaga 
agtgggagtt 
ccataatagg 
gaaaagcagg 
caaatcagaa 
taaacatagt 
gtgaatcaga 
tgtcatgggt 
gtaagggaat 



atggacatat 
aatgaggact 
tatggaaagc 
aacatgggag 
tgttaatacc 
agcagaaact 
gtatgttact 
gactgaatta 
aacagactca 
gttagtcaac 
accagcacat 
caggaaagtg 



caaatttacc 
gcccacacta 
atagtaatat 
acatggtgga 
cctcccttag 
ttctatgtag 
gacagaggaa 
caagcaattc 
cagtatgcat 
caaataatag 
aaaggaattg 
ctgtttctag 



aagaaccatt 
atgatgtaaa 
ggggaaagac 
cagactattg 
taaaattatg 
atggagcagc 
ggcagaaaat 
agctagcttt 
taggaatcat 
aacaattaat 
gaggaaatga 
atggaataga 



caaaaacctg 
acagttaaca 
tcctaaattt 
gcaagccacc 
gtaccagcta 
taatagggaa 
tgtttctcta 
gcaagattca 
tcaagcacaa 
aaaaaaggaa 
acaaatagat 
t 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2151 



<210> 36 
<211> 54 
<212> DNA 

<213> Human immunodeficiency virus 



<400> 36 

ggcggcatcg tgatctacca gtacatggac gacctgtacg tgggcagcgg cggc 



54 



<210> 37 
<211> 18 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 37 

Gly Gly He Val He Tyr Gin Tyr Met Asp Asp Leu Tyr Val Gly Ser 
15 10 15 



Gly Gly 



<210> 38 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
SlFCSacTA 

<400> 38 

gtttcttgag ctctggaagg gttaatttac tccaagaa 38 

<210> 39 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
SlFTSacTA 

<400> 39 

gtttcttgag ctctggaagg gttaatttac tctaagaa 38 



30 



<210> 40 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
S145RTSalTA 

<400> 40 

gtttcttgtc gacttgtcca tgtatggctt cccct 35 

<210> 41 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
S145RCSalTA 

<400> 41 

gtttcttgtc gacttgtcca tgcatggctt ccct 34 

<210> 42 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
S245FASalTA 

<400> 42 

gtttcttgtc gactgtagtc caggaatatg gcaattag 38 

<210> 43 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
S245FGSalTA 

<400> 43 

gtttcttgtc gactgtagtc cagggatatg gcaattag 38 

<210> 44 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
S2FullNotTA 
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<400> 44 

gtttcttgcg gccgctgcta gagattttcc acactacca 



39 



<210> 45 
<211> 9738 
<212> DNA 

<213> Human immunodeficiency virus 



<400> 45 

tggaagggtt 

cacaaggcta 

tgacctttgg 

acggaggaga 

acaaagaagt 

agctacatcc 

tccactgggg 

tgctgcatat 

agcctgggag 

ttgagtgctc 

gagatccctc 

gacttgaaag 

gtgcactcgg 

gcggaggcta 

gacaaatggg 

ctagtatggg 

tcagacggat 

gaaattagat 

gtacgagaca 

aaaacacagc 

aacctccaag 

aaagtaatag 

gaaggagcca 

gccatgcaaa 

ccagtacatg 

gcaggaacta 

ccagtagggg 

tacagccctg 

gtagaccggt 

atgacagaca 

ttaggaccag 

agccacaaag 

atacagaaaa 

gaagggcaca 

aaggaaggac 

tggccttccc 

ccaccactag 

caggagccga 

ttgtctcaat 

atacagtact 

ttggaggttt 

gggctatagg 

tgactcagct 

aattaaagcc 

taaaagcatt 

ggcctgaaaa 



aatttactcc 
cttccctgat 
atggtgcttc 
agacaactgt 
gttacagtgg 
ggattattac 
cgttccaggg 
aagcagcggc 
ctctctgtct 
taagtagtgt 
agaccctttg 
cgaaagtgag 
caagaggcga 
gaaggagaga 
aaaaaattag 
caagcagaga 
gtagacaaat 
cattatttaa 
ccaaggaagc 
aggcggaagc 
ggcaaatggt 
aggagaaggc 
ccccacaaga 
tgttaaaaga 
cagggcctgt 
ctagtaccct 
acatctataa 
tcagcatttt 
tcttcaaaac 
ccttgttagt 
gggcttcatt 
caagagtttt 
gcaattttaa 
tagccaggaa 
accaaatgaa 
acaaggggag 
aaccaacagc 
aagacaggga 
aaaagtagcg 
agaagaaata 
tatcaaagta 
tacagtatta 
tggatgcaca 
aggaatggat 
aacagaaatt 
tccatataac 



aggaaaaggc 
tggcaaaact 
aagctagtgc 
ttgctacacc 
aagtttgaca 
aaagactgct 
ggagtggtct 
ttttcgcctg 
atctggggaa 
gtgcccatct 
tggtagtgtg 
accagagaag 
ggggggcgac 
gatgggtgcg 
gttacggcca 
gctggaaaga 
aataaaacag 
cacagtagca 
cttagacaag 
ggctgacaaa 
acaccaggcc 
ttttagccca 
tttaaacacc 
taccatcaat 
tgcaccaggc 
tcaagaacaa 
aaggtggata 
agacataaaa 
tttaagagct 
ccaaaatgcg 
agaagaaatg 
ggctgaggca 
aggccctaga 
ttgcagggcc 
agactgtact 
gccagggaat 
cccaccagca 
acctttaact 
ggccaaacaa 
aacttgccag 
agacagtatg 
gtaggaccta 
ctaaattttc 
ggcccaaagg 
tgtgaggaaa 
actccagtat 



aagagatcct 
acacaccggg 
cagttgaccc 
ctatgagcca 
gcagcctagc 
gacacagaag 
gggcgggact 
tactgggtct 
cccactgctt 
gttgtgtgac 
gaaaatctct 
atctctcgac 
tggtgagtac 
agagcgtcaa 
ggggggagaa 
tttgcagtta 
ctacaaccag 
actctctatt 
atagaggagg 
aaggtcagtc 
atatcaccta 
gaggtaatac 
atgttaaata 
gaggaggctg 
cagatgagag 
atagcatgga 
attctggggt 
caaggaccaa 
gaacaatcta 
aacccagatt 
atgacagcat 
atgagccaag 
agagctgtta 
cctaggaaaa 
gagaggcagg 
ttccttcaga 
gagagcttca 
tccctcaaat 
aggaggctct 
gaaaatggaa 
atcaaatact 
cacctgtcaa 
caattagccc 
ttaaacaatg 
tggagaagga 
ttgccataaa 



tgatttatgg 
accaggggtc 
aagggaagta 
gtatggaatg 
acgcagacac 
ggactttccg 
gggagtggcc 
ctctaggtag 
aggcctcaat 
tctggtaact 
agcagtggcg 
gcaggactcg 
gccaaaattt 
tattaagagg 
aacactatat 
accctggcct 
ctcttcagac 
gtgtacataa 
aacaaaacaa 
aaaattatcc 
gaaccttgaa 
ccatgtttac 
cagtgggggg 
cagaatggga 
aaccaagggg 
tgacaagtaa 
taaataaaat 
aggaaccctt 
cacaagaggt 
gtaagaccat 
gtcagggagt 
caaacaatac 
aatgtttcaa 
ggggctgttg 
ctaatttttt 
gcagaccaga 
agttcaagga 
cactctttgg 
tttagataca 
accaaaaatg 
tatagaaatt 
cataattgga 
cattgaaact 
gccattgaca 
aggaaaaatt 
gaagaaggac 



gtctatcaca 
agatatccac 
gaagaggcca 
gatgatgaac 
ctggcccgcg 
cctgggactt 
agccctcaga 
accagatccg 
aaagcttgcc 
ctggtaacta 
cccgaacagg 
gcttgctgaa 
tttttgacta 
gggaaaatta 
gctaaaacac 
tttagagaca 
aggaacagag 
agggatagat 
atgtcagcaa 
tatagtgcag 
tgcatgggta 
agcattatca 
acatcaagca 
taggttacat 
aagtgacata 
cccacctatc 
agtaagaatg 
tagagactat 
aaaaaattgg 
tttaagagca 
gggaggacct 
aagtgtaatg 
ctgtggcagg 
gaaatgtgga 
agggaaaatt 
gccaacagcc 
gactccgaag 
cagcgacccc 
ggagcagatg 
ataggaggaa 
tgtggaaaaa 
agaaatctgt 
gtaccagtaa 
gaagaaaaaa 
acaaaaattg 
agtacaaagt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 
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ggagaaaatt 
aattaggaat 
tgggagatgc 
ccatacctag 
agggatggaa 
ttagaacaca 
ctgacttaga 
aatggggatt 
ggtatgaact 
gctggactgt 
acccagggat 
acatagtgcc 
aagaaccagt 
aacaggggaa 
caggaaagta 
cagtgcaaaa 
tacccatccc 
ttcctgagtg 
aagaacccat 
aaataggaaa 
aaacaacaaa 
cagaagtaaa 
ataagagtga 
tctacctatc 
tagtaagtag 
agcatgaaaa 
tagtagcaaa 
atggacaagt 
aaatcatcct 
cagaaacagg 
aagtaataca 
ggtgggcaga 
tagaatccat 
accttaagac 
ttggggggta 
aagaattaca 
gagaccctat 
tacaagataa 
atggaaaaca 
tggcacagtt 
agacatcatt 
gatgccaggt 
ttgggtcacg 
gacctgacag 
aggaaagcca 
aaggtaggat 
ccacctctgc 
aggggccgca 
aggaagctgt 
aaacctatgg 
tgtttattca 
gagcaagaaa 
cctaaaactg 
tttcagaaaa 
gctcctccaa 
atatgtaatg 
agtagcacta 



agtagatttc 
accacaccca 
atatttttca 
tataaacaat 
aggatcacca 
aaacccagaa 
aatagggcaa 
taccacacca 
ccatcctgac 
caatgatata 
taaagtaagg 
actgactgaa 
acatggagta 
tgaccaatgg 
tgcaaaaatg 
gataacccag 
aaaagaaaca 
ggagtttgtc 
agtaggggca 
agcagggtat 
tcagaagact 
catagtaaca 
atcagaatta 
atgggtacca 
tggaatcaga 
atatcacagc 
ggaaatagta 
cgactgtagt 
agtagcagtc 
acaagaaaca 
tacagataat 
tatccaacgg 
gaataaagaa 
agcagtacaa 
cagtgcaggg 
aaaacaaatt 
ttggaaagga 
tagtgatata 
gatggcaggt 
tagtaaagca 
atgaaagcag 
tagtaataaa 
gagtctccat 
accaactaat 
tactaggaca 
ctctacaata 
ctagtgttag 
gagggaacca 
cagacacttt 
agatacttgg 
tttcaggatt 
tggagccaat 
cttgtaatgg 
aaggcttagg 
gcaataaaga 
ttagatttaa 
atcatagcaa 



agggaactca 
gcagggttaa 
gtccctttag 
gaaacaccag 
gcaatattcc 
gtagttatct 
catagagcaa 
gacaagaaac 
aaatggacag 
cagaagttag 
caactgtgta 
gaagcagaat 
tattatgacc 
acatatcaaa 
aggactgccc 
gaaagcatag 
tgggagacat 
aatacccctc 
gaaactttct 
gtcactgaca 
gaattacaag 
gactcacagt 
gtcagtcaaa 
gcacataaag 
aaagtactgt 
aattggagag 
gccagctgtg 
ccaggaatat 
catgtagcca 
gcatacttta 
ggcagtaatt 
gaatttggaa 
ttaaagaaaa 
atggcagtat 
gagagaataa 
ataaaaattc 
ccagccaaac 
aaggtagtac 
g'ctgattgtg 
ccatatgtat 
acacccaaaa 
aacatattgg 
agaatggaga 
tcatatgcat 
gatagttagc 
cttggcactg 
gaaattagta 
tacaatgaat 
cctagaccat 
acaggagttg 
gggtgccatc 
agatcctaac 
gtgttactgt 
catttactat 
tcatcaagat 
ctgcaagaat 
taatagtgtg 



ataaaagaac 
aaaagaaaaa 
atgagagctt 
ggattagata 
agagtagcat 
atcaatatat 
aaatagagga 
atcagaaaga 
tacagcctat 
tgggaaagtt 
aactccttag 
tagaattggc 
catcaaaaga 
tttaccaaga 
acactaatga 
taatatgggg 
ggtggtcaga 
ccctagtaaa 
atgtagatgg 
aaggaaggca 
caattcagct 
atgcattagg 
taatagaaca 
gaattggagg 
ttctagatgg 
caatggctag 
ataaatgtca 
ggcaattaga 
gtggctacat 
tactaaaatt 
tcaccagtac 
ttccctacaa 
tcatagggca 
tcattcacaa 
tagacataat 
aaaattttcg 
tactctggaa 
caagaaggaa 
tggcaggtag 
gtttcgagga 
gtaagttcag 
ggtctgcaga 
ttgagaagat 
tattttgatt 
cctaagtgtg 
acagcattga 
gaggatagat 
ggacactaga 
ggctccataa 
aagcaataat 
atagcagaat 
ctagaaccct 
aaacgttgca 
ggcaggaaga 
cctctaccaa 
agattctaga 
gaccatagta 



tcaagacttt 
atcagtgaca 
cagaaaatat 
tcaatataat 
gacaagaatc 
ggatgactta 
gttaagagga 
acccccattt 
acagctgcca 
aaactgggca 
gggagccaaa 
tgagaacagg 
tttaatagct 
accatttaaa 
tgtgaaacag 
aaaaactcct 
ctattggcaa 
attgtggtac 
agcagccaat 
gaaagttgtt 
agctttgcag 
aatcattcaa 
gttgataaaa 
aaatgaacaa 
aatagataaa 
tgagtttaat 
gctaaaaggg 
ctgtacacat 
ggaagcagag 
agcaggaaga 
cgcagttaag 
tccccaaagt 
agtaagagat 
ttttaaaaga 
agcatcagac 
ggtttattac 
aggtgaaggg 
agcaaaaatc 
acaggatgaa 
gagctgatgg 
aagtacacat 
caggagaaag 
atagcacaca 
gttttgcaga 
actatcaagc 
taaaaccaaa 
ggaacaagcc 
gcttttagaa 
cttaggacaa 
aagaatcctg 
aggcattttg 
ggaaccatcc 
gctatcattg 
agcggagaca 
agcagtaagt 
ttaggaatag 
tatatagaat 



tgggaagtcc 
gtactggatg 
actgcattca 
gttcttccac 
ttagagccct 
tatgtaggat 
cacctattga 
ctttggatgg 
gaaaaggaga 
agtcagattt 
gcactaacag 
gaaattctaa 
gaaatacaga 
aatctgagaa 
ttagcagagg 
aaatttagac 
gccacctgga 
cagctggaaa 
agggaaacta 
tccttcactg 
gattcagggc 
gcacaaccag 
aaggaaaaag 
gtagacaaat 
gctcaagaag 
ctgccaccca 
gaagccatgc 
ttagaaggaa 
gttatcccag 
tggccagtca 
gcagcctgtt 
caaggagtag 
caagctgagc 
aaagggggga 
atacaaacta 
agagacagca 
gcagtagtaa 
attaaggact 
gattagaaca 
atggttctac 
cccattagga 
agcttggcat 
agtagaccct 
atctgccata 
aggacataac 
aaagataaag 
ccagaagacc 
gaactcaagc 
catatctatg 
caacaattac 
cgacagagaa 
aggaagtcag 
tctagtttgc 
gcgacgaagc 
accgaatagt 
gagcattgat 
ataggaaatt 



2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
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ggtaaggcaa 
tggcaatgag 
taggcttttg 
gggtacctgt 
atgagaaaga 
cacaagaagt 
tggatcagat 
aattgacccc 
cctctaaaga 
agaaagaaaa 
ggaatattaa 
caaaagtctc 
taaaatgtaa 
aatgtacaca 
cagaagaaga 
tacatcttaa 
gtgtgagaat 
gacaagcaca 
aaaaattaca 
gggacctaga 
caacagacct 
ctaataatac 
aggaggtagg 
atattacagg 
tcagacctgg 
tggtagaaat 
gagaaaaaag 
gcactatggg 
tagtgcaaca 
tcacggtctg 
aggatcaaca 
tgctttggaa 
ggatgcagtg 
actcgcaaag 
atctgtggaa 
tagtaggagg 
ttaggcaggg 
acaggctcgg 
tagtgagcgg 
accaccgatt 
gtagtctcag 
aatattgggg 
cagtagctga 
gcaacgtacc 
aagtggtcaa 
aggtcagcag 
acaaccagca 
gaaggagaag 
gcagcaatag 
tccaagaaaa 
gattggcaaa 
ttcaagctag 
tgtttactac 
tggaagtttg 
tacaaagact 
gaggtgtggt 
tgcttttcgc 



aggaaaatag 
agcgaggggg 
gatgctaatg 
ggggagagaa 
agtgcataat 
gattttgggc 
gcaggaagat 
actctgtgtc 
catgaaaaat 
tgcacttttt 
caactataga 
gtttgaccca 
taataagaaa 
tggaattaag 
gataataatt 
tgaatctata 
aggaccagga 
ttgtaacatt 
agaactcttc 
aattactaca 
gtttaatagt 
agagcgcatc 
acgagcaatg 
actactatta 
aggaggagac 
taaaccatta 
agcagtagga 
cgcagcatca 
gcaaagtaat 
gggcattaag 
gctcctagga 
ctctagttgg 
ggatagggaa 
ccagcaggaa 
ttggtttagc 
cttgataggt 
atactcaccc 
aggaatcgaa 
attcttgaca 
gagagacttc 
gggactgcag 
tctagagtta 
aggaacagat 
tagaagaata 
aaagcagtat 
cagagggagt 
acacagccca 
taggctttcc 
atctcagctt 
ggcaagagat 
actacacacc 
agccagtcga 
accctatgag 
acagtacgct 
gctgacacag 
ctgggcggga 
ctgtactggg 



actggttagt 
atactgaaga 
atgtgtaatg 
gcaaaaacta 
gtctgggcta 
aatgtaacag 
ataatcagtt 
actttaaact 
tgctctttct 
tatagacttg 
ttaataaatt 
attcctatac 
ttcaatggaa 
ccagtggtat 
agatctgaaa 
gagattaaat 
caagcattct 
agtaaaaatg 
cctaatagta 
catagcttta 
acatacagta 
acactccaat 
tatgcccctc 
acacgtgatg 
atgagggaca 
ggagtagcac 
ataggagctg 
ataacgctga 
ttgctgaggg 
cagctccagg 
ctgtggggct 
agtaataaaa 
attagtaatt 
agaaatgaaa 
ataacaaatt 
ttaagaataa 
ttgtcattgc 
gaagaaggtg 
cttgcctggg 
atattaattg 
agggggtggg 
aaaaagagtg 
aggattctag 
agacagggct 
aattggatgg 
aggatcagcg 
caacaatgct 
agtcagacct 
ctttttaaaa 
ccttgatttg 
gggaccaggg 
tccaagggaa 
ccagcatgga 
agcacgcaga 
aagggacttt 
caggggagtg 
tctctctagg 



taaaaggatt 
attatcgaca 
tgaagggctt 
ctctattttg 
cacatgcctg 
aaaattttaa 
tatgggatca 
gtacaaatgc 
atgtaaccac 
atatagtacc 
gtaatacctc 
attattgtgc 
taggaccatg 
caactcaatt 
atctgacaaa 
gtacaagacc 
atgcaacagg 
aatggaatac 
cagggataaa 
attgtggagg 
atggtacatg 
gcagaataaa 
ccattgcagg 
gaggagataa 
attggagaag 
ccactgctgc 
tgttccttgg 
cggtacaggc 
ctatagaggc 
caagagtcct 
gctctggaaa 
ctcaaagtga 
acacaaacac 
aagatttact 
ggctgtggta 
tttttgctgt 
agacccttat 
gagagcaaga 
acgacctacg 
tagtgagagc 
gaacccttaa 
ctattaatct 
aattcataca 
tcgaagcagc 
cctgaagtaa 
tctcaagact 
gcttgcgcct 
caggtacctt 
gaaaaggggg 
tgggtttata 
gtcagatttc 
gtagaagagg 
atggaggatg 
cacatggccc 
ccgctgggac 
gtcagccctg 
tagaccagat 



agggaaagag 
ctggtggata 
gtgggtcaca 
tgcatcagat 
tgtacccaca 
catgtggaaa 
aagccttaag 
aactgttaac 
agaattaaga 
acttaataat 
agccataaca 
tccagctggt 
cgataatgtc 
actgttaaat 
caatgtcaaa 
tggcaataat 
agacataata 
aactttacaa 
atttgcacca 
agaatttttc 
cactaatggt 
acaaattata 
aaacataaca 
taatactgaa 
tgaattatat 
aaaaaggaga 
gttcttggga 
cagacaatta 
gcaacagcat 
ggctatagag 
actcatctgc 
tatttgggat 
aatatacagg 
agcattggac 
tataaaaata 
gctctctcta 
cccaaacccg 
cagcagcaga 
aagcctgtgc 
agtggaactt 
gtatttgggg 
gcttgatact 
aaacctttgt 
tttgcaataa 
gagaaagaat 
tagagaaaca 
ggctggaagc 
taagaccaat 
gactggaagg 
acacacaagg 
cactgacctt 
ccaatgaagg 
aagacagaga 
gcgagctaca 
tttccactgg 
agatgctgca 
ctgagcccgg 



cagaagacag 
tggggcatct 
gtctactacg 
gctaaagcat 
gaccccaacc 
aatgacatgg 
ccatgtgtaa 
tacaataata 
gataagaaaa 
aggaagaatg 
caagcctgtc 
tatgcgcctc 
agcacagtac 
ggtagcctag 
acaataatag 
acaagaaaga 
ggagatataa 
agggtaagtc 
cactcaggag 
tattgcaata 
acatgcatgt 
aacatgtggc 
tgtagatcaa 
acagagacat 
aaatacaagg 
gtggtggaga 
gcagcaggaa 
ttgtctggta 
atgttgcaac 
agatacctac 
accactaatg 
aacatgacct 
ttgcttgaag 
aggtggaaca 
ttcataatga 
gtaaatagag 
aggggacccg 
tccattcgat 
ctcttctgct 
ctgggacaca 
agtcttgtgc 
atagcaatag 
agaggtatcc 
aatggggggc 
cagacgaact 
tggggcactt 
gcaagaggag 
gacttataaa 
gttaatttac 
cttcttccct 
tggatggtac 
agaaaacaac 
agtattaaga 
tccggagtat 
ggcgttccag 
tataagcagc 
gagctctctg 



6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
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gctatctagg gaacccactg cttaagcctc aataaagctt gccttgagtg ccttgagtag 9660 
tgtgtgcccg tctgttgtgt gactctggta actagagatc cctcagacca cttgtggtag 9720 
tgtggaaaat ctctagca 9738 



<210> 46 

<211> 4 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: spacer 

<400> 46 
Gly Gly Gly Ser 
1 
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